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Abstract 

We review results on the scaling of the optimal path length ^opt in random networks with 
weighted links or nodes. We refer to such networks as "weighted" or "disordered" networks. The 
optimal path is the path with minimum sum of the weights. In strong disorder, where the maximal 
weight along the path dominates the sum, we find that £opt increases dramatically compared to 
the known small world result for the minimum distance imin ~ log-/V, where N is the number of 
nodes. For Erdos-Renyi (ER) networks £opt ~ iV^/^, while for scale free (SF) networks, with degree 
distribution P{k) ~ k-^, we find that e^pt scales as Ar('^-3)/('^-i) for 3 < A < 4 and as A^V^ for 
A > 4. Thus, for these networks, the small-world nature is destroyed. For 2 < A < 3 in contrary, 
our numerical results suggest that £opt scales as ln^~^ N, representing still a small world. We also 
find numerically that for weak disorder £opt ~ IniV for ER models as well as for SF networks. We 
also review the transition between the strong and weak disorder regimes in the scaling properties 
of £opt for ER and SF networks and for a general distribution of weights r, P{t). For a weight 
distribution of the form P(r) = l/iar) with (rmin < t < Tmax) and a = lnTniax/7"mm, we find that 
there is a crossover network size A'^* = N*{a) at which the transition occurs. For N <ti N* the 
scaling behavior of £opt is in the strong disorder regime, while for N S> N* the scaling behavior is in 
the weak disorder regime. The value of N* can be determined from the expression iooi^*) = aPc, 
where ioo is the optimal path length in the limit of strong disorder, A = apc oo and pc is the 
percolation threshold of the network. We suggest that for any P(r) the distribution of optimal 
path lengths has a universal form which is controlled by the scaling parameter Z = £00/ A where 
A = PcTc/ Jq" TP{T)dT plays the role of the disorder strength and Tc is defined by Jq" P{T)dT = Pc- 
In case P(r) ~ l/(ar), the equation for A is reduced io A = apc- The relation for A is derived 
analytically and supported by numerical simulations for Erdos-Renyi and scale-free graphs. We also 
determine which form of P{t) can lead to strong disorder A — > 00. We then study the minimum 
spanning tree (MST), which is the subset of links of the network connecting all nodes of the network 
such that it minimizes the sum of their weights. We show that the minimum spanning tree (MST) 
in the strong disorder limit is composed of percolation clusters, which we regard as "super-nodes", 
interconnected by a scale-free tree. The MST is also considered to be the skeleton of the network 
where the main transport occurs. We furthermore show that the MST can be partitioned into two 
distinct components, having significantly different transport properties, characterized by centrality 
— number of times a node (or link) is used by transport paths. One component the superhighways, 
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for which the nodes (or hnks) with high centrahty dominate, corresponds to the largest cluster at 
the percolation threshold (incipient infinite percolation cluster) which is a subset of the MST. The 
other component, roads, includes the remaining nodes, low centrality nodes dominate. We find 
also that the distribution of the centrality for the incipient infinite percolation cluster satisfies a 
power law, with an exponent smaller than that for the entire MST. We demonstrate the significance 
identifying the superhighways by showing that one can improve significantly the global transport 
by improving a very small fraction of the network, the superhighways. 

PACS numbers: 89.75.Hc,89.20.Ff 

Keywords: minimum spanning tree, percolation, scale-free, optimization 
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I. INTRODUCTION 



Recently much attention has been focused on the topic of complex networks which char- 
acterize many biological, social, and communication systems 0,0,0]. The networks are 
represented by nodes associated to individuals, organizations, or computers and by links 
representing their interactions. The classical model for random networks is the Erdos-Renyi 
(ER) model 4,0,0]. An important quantity characterizing networks is the average distance 
(minimal hopping) £min between two nodes in the network of total nodes. For the Erdos- 
Renyi network fmin scales as InA^ Q], which leads to the concept of "small worlds" or "six 
degrees of separation". For scale- free (SF) [l] networks ^min scales as InlnA^, this leads to 
the concept of ultra small worlds ^ 0] . 

In most studies, all links in the network are regarded as identical and thus a crucial 
parameter for information flow including efficient routing, searching, and transport is irnw, . In 
practice, however, the weights (e.g., the quality or cost) of links are usually not equal [9|, |lO|. 

Thus the length of the optimal path iopt, minimizing the sum of weights, is usually longer 
than irain- For example, the cost could be the time required to transit the link. There are 
often many traffic routes from site A to site B with a set of transit time r,, associated 
with each link along the path. The fastest (optimal) path is the one for which J2i is a 
minimum, and often the optimal path has more links than the shortest path. In many cases, 
the selection of the path is controlled by most of the weights (e.g., total cost) contributing 
to the sum. This case corresponds to weak disorder (WD). However, in other cases, for 
example when the distribution of disorder is very broad a single weight dominates the sum. 
This situation — in which one link controls the selection of the path — is called the strong 
disorder limit (SD). 

For a recent quantitative criterion for SD and WD, see Ref j3] and Section IVir B) in this 
article. 

The strong disorder is relevant e.g. for computer and traffic networks, since the slowest 
link in communication networks determines the connection speed. An example for SD is 
when a transmission at a constant high rate is needed (e.g., in broadcasting video records 
over the Internet). In this case the narrowest band link in the path between the transmitter 
and receiver controls the rate of transmission. This limit is also called the "ultrametric" 
limit and we refer to the optimal path in this limit as the min-max path. 
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The SD limit is also related to the minimal spanning tree which includes all optimal paths 
between all pairs of sites in the network. The disorder on a network is usually implemented 



22 



2i 



y 



32l |. We assign to each 



from a distribution P{t) ~ l/(ar), where 1 < r < e" 
link of the network a random number r, uniformly distributed between and 1. The cost 
associated with link i is then Tj = exp(arj) where a is the parameter which controls the 
broadness of the distribution of link costs. The parameter a represents the strength of 
disorder. The limit a ^ cxd is the strong disorder limit, since for this case clearly only one 
link dominates the cost of the path. The strong disorder limit (SD) can be implemented in a 
disordered media by assigning to each link a potential barrier ej so that is the time to cross 
this barrier in a thermal activation process. Thus Tj = e'^^^^'^ , where K is the Boltzmann 
constant and T is absolute temperature. The optimal path corresponds to the minimum 
over all possible paths. We can define disorder strength a = 1/KT. When a ^ oo, 
only the largest Xj dominates the sum. Thus, T ^ (very low temperature) corresponds to 
the strong disorder limit. 

There are distinct scaling relationships between the length of the average optimal path 
£opt and the network size (number of nodes) depending on whether the network is strongly 



or weakly disordered 



30, 



32| . It was shown using percolation arguments (See Section HV|) 
opt ~ iV'^°p*, where i^opt = 1/3 for Erdos-Renyi (ER) random 



that for strong disorder ^ 

networks jj] and for scale-free (SF) \}\ networks with A > 4, where A is the exponent 
characterizing the power law decay of the degree distribution. For SF networks with 3 < 
A < 4, z/opt = (A — 3)/(A — 1). For 2 < A < 3, percolation arguments do not work, but 



In 



A^, which is again much larger than the ultra 

. When the 



the numerical results suggest iopt 

small result for the shortest path ij^^^ ~ InlniV found for 2 < A < 3 in Ref.j 
weights are taken from a uniform distribution we are in the weak disorder limit. In this case 
iopt ~ InA^ for both ER and SF for all the values of A 3^. For 2 < A < 3, this result is 
significantly different from the ultra small world result found for unweighed networks. 

Porto et al . [30] considered the optimal path transition from weak to strong disorder for 
2-D and 3-D lattices, and found a crossover in the scaling properties of the optimal path 
that depends on the disorder strength a, as well as the lattice size L (see also 23|). Similar 
to regular lattices, there exists for any finite a, a crossover network size A^* (a) such that for 

<^ N*{a), the scaling properties of the optimal path are in the strong disorder regime 
while for A^ ^ N*{a), the network is in the weak disorder regime. The function N*{a) 
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was evaluated. Moreover, a general criterion to determine which form of -P(t) can lead to 
strong disorder, and a general condition when strong or weak disorder occurs was found 
analytically 73]. The derivation was supported by extensive simulations. 

The study of the distribution of the length of the optimal paths in a network was re- 
ported in Ref [44 1. It was found that the distribution has the scaling form P{iopt, N,a) ~ 
j-G (j^, ^^)) where ioo is iopt for a 00 and pc is the percolation threshold. It was 
also shown that a single parameter Z = determines the functional form of the distri- 



bution. Importantly, it was found [7^ that for all P{t) that possess a strong-weak disorder 
crossover, the distributions P(£opt) of the optimal path lengths display the same universal 
behavior. 

Another interesting question is about a possible origin of scale-free degree distribution 
with A = 2.5 in some real world networks. Kalisky et al, [77| introduced a simple process that 
generates random scale-free networks with A = 2.5 from weighted Erdos-Renyi graphs P|. 
They found that the minimum spanning tree (MST) on an Erdos-Renyi graph is composed 
of percolation clusters, which we regard as "super nodes" , interconnected by a scale-free tree 
with A = 2.5. 

Known as the tree with the minimum weight among all possible spanning tree, the MST 
is also the union of all "strong disorder" optimal paths between any two nodes jlll . 14 , [2^ 
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As the global optimal tree, the MST plays a major role for transport process. 



which is widely used in different fields, such as the design and operation of communication 
networks, the travelling salesman problem, ;he protein interaction problem, optimal traffic 



flow, and economic networks 



One important question in network 



transport is how to identify the nodes or links that are more important than others. A 
relevant quantity that characterizes transport in networks is the betweenness centrality, C, 
which is the number of times a node (or link) used by all optimal paths between all pairs of 
For simplicity we call the "betweenness centrality" here "centrality" and 



nodes 
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we use the notation "nodes" but similar results have been obtained for links. The centrality, 
C, quantifies the "importance" of a node for transport in the network. Moreover, identifying 
the nodes with high C enables us to improve their transport capacity and thus improve the 
global transport in the network. The probability density function (pdf) of C was studied 



on the MST for both SF [61] and ER \m networks and found to satisfy a power law, 
Pmst(C) ~ C'^'^MST^ with (5mst close to 2 jsj, However, Ref 0] found that a sub- 
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network of the MST j59|, the infinite incipient percolation cluster (IIC) has a significantly 
higher average C than the entire MST — i.e., the set of nodes inside the IIC are typically 
used by transport paths more often than other nodes in the MST (See Section IIX|) . In this 
sense the IIC can be viewed as a set of superhighways (SHW) in the MST. The nodes on 
the MST which are not in the IIC are called roads, due to their analogy with roads which 
are not superhighways (usually used by local residents). Wu et al. j60| demonstrate the 
impact of this finding by showing that improving the capacity of the superhighways (IIC) 
is significantly a better strategy to enhance global transport compared to improving the 
same number of links of the highest C in the MST, although they have higher C Q]. This 
counterintuitive result shows the advantage of identifying the IIC subsystem, which is very 



small and of oder zero compared to the full network 
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II. ALGORITHMS 

A. Construction of the Networks 

To construct an ER network of size with average node degree (/c), we start with {k)N/2 
edges and randomly pick a pair of nodes from the total possible N{N — l)/2 pairs to connect 
with an edge. The only condition we impose is that there cannot be multiple edges between 
two nodes. When (k) > 1 almost all nodes of the network will be connected with high 
probability. 

To generate scale-free (SF) graphs of size N, we employ the Molloy-Reed algorithm : 
initially the degree of each node is chosen according to a scale-free distribution, where each 
node is given a number of open links or " stubs" according to its degree. Then, stubs from 
all nodes of the network are interconnected randomly to each other with two constraints 
that there are no multiple edges between two nodes and that there are no looped edges with 
identical ends. The exact form of the degree distribution is usually taken to be 

P{k) = ck-^ k = m,---K (1) 

where m and K are the minimal and maximal degrees, and c ~ (A — l)m^^^ is a normalization 
constant . For real networks with finite size, the highest degree K depends on network size 
N: K ^ mN^^^^~^\ thus creating a "natural" cutoff for the highest possible degree . When 
m > 1 there is a high probability that the network is fully connected. 



B. Dijkstra's algorithm 

The Dijkstra's algorithm is used in general to find the optimal path, when the weights 
are drawn from an arbitrary distribution. The search for the optimal path follows a procedure 
akin to "burning" where the "fire" starts from our chosen origin. At the beginning, all nodes 
are given a distance oo except the origin which is given a distance 0. At each step we choose 
the next unburned node which is nearest to the origin, and "burn" it, while updating the 
optimal distance to all its neighbors. The optimal distance of a neighbor is updated only if 
reaching it from the current burning node gives a total path length that is shorter than its 
current distance. 



C. Ultrametric Optimization 

Next we describe a numerical method for computing £opt between any two nodes in strong 



disorder 



22]. In this case the sum of the weights must be completely dominated by the 
largest weight. Sometimes this condition is referred as ultrametric. We can satisfy this 
condition assigning weights to all the links Tj = exp(arj) choosing a to be so large, that 
any two links will have different binary orders of magnitude. For example, if we can select 
< Tj < 1 from a uniform distribution, using a 48-bit random number generator, there 
will be no two identical values of rj in a system of any size that we study. In this case 
> 2~^^ and we can select a > 2'^^ In 2 to guarantee the strong disorder limit. To find 
the optimal paths under the ultrametric condition, we start from one node (the origin — see 
Fig. ^ and visit all the other nodes connected to the origin using a burning algorithm. If 
a node at distance io (from the origin) is being visited for the first time, this node will be 
assigned a list Sq of weights TQi, i = 1 ■■■ £q of the links by which we reach that node sorted 
in descending order. Since roj = exp{aroi), we can use a list of random numbers roj instead. 

5'o = {roi,ro2,ro3, ...rozo}, (2) 

with rQj > Toj+i for all j. If we reach a node for a second time by another path of length ii, 
we define for this path a new list Si, 

5'i = {rii,ri2,ri3, . . .ri;J, (3) 

and compare it with 5*0 previously defined for this node. 
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Different sequences can have weights in common because some paths have hnlcs in common 
because of the loops, so it is not enough to identify the sequence by its maximum weight; 
in this case it must also be compared with the second maximum, the third maximum, etc. 
We define Sp < Sq if there exists a value m, 1 < m < min(£p, £q) such that 

Vpj = Tqj for 1 < i < m and 

Tpj < Tqj for j = m, (4) 

or if iq > ip and Vpj = Vqj for all j < ip. If 5*1 < 5*0, we replace Sq by 5*1. The procedure 
continues until all paths have been explored and compared. At this point, 5*0 = 5'opt, 
where S'opt is the sequence of weights for the optimal path of length £opt- A schematic 
representation of this ultrametric algorithm is presented in Fig. ^ This algorithm is slow 
and memory consuming since we have to keep track of a sequence of values and the rank. 
Using this method, we obtain systems of sizes up to 2^^ nodes, typically 10^ realizations of 
disorder. 

D. Bombing Optimization 

This algorithm allows to compute £opt (and other relevant quantitiesVbetween any two 
nodes in strong disorder limit and was introduced by Cieplak et. al. [2j|. Basically the 
algorithm does the following 

1. Sort the edges by descending weight. 

2. If the removal of the highest weight edge will not disconnect A from B - remove it. 

3. Go back to step El until all edges have been processed. 

Since the edge weights are random, so is the ordering. Therefore, in fact, one does not need 
even to select edge weights and "bombing" algorithm can be simplified by removing randomly 
chosen edges one at a time, provided that their removal does not break the connectivity 
between the two nodes. The bottleneck of this algorithm is checking the connectivity after 
each removal. To speed it up, we first compute the minimal path between nodes A and B 
using Dijkstra's algorithm. Then we must check the connectivity only if the removed bond 
belongs to this path. In this case, we attempt to compute a new minimal path between A 
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FIG. 1: In (a) we show schematically a network consisting of five nodes (A, B, C, D, and E). The 
links between them are shown in dashed lines. The origin (A) is marked in gray. All links were 
assigned random weights, shown beside the links. In (b) one node (C) has been visited for the 
first time (marked in black) and assigned the sequence (8) of length i = 1. The path is marked 
by a solid arrow. Notice that there is no other path going from the origin (A) to this node (C) 
so iopt = 1 for that path. In (c) another node (B) is visited for the first time (marked in black) 
and assigned the sequence (10, 8) of length 2. The sequence has the information of all the weights 
of that path arranged in decreasing order. In (d) another node (D) is visited for the first time 
and assigned the sequence (8,7) of length 2. In (e), node (B) visited in (c) with sequence (10,8) 
is visited again with sequence (8,7,6). The last sequence is smaller than the previous sequence 
(10, 8) so that node (B) is reassigned the sequence (8, 7, 6) of length 3 [See Eq. Q)]. The new path 
is shown as a solid line. In (f) a new node (E) is assigned with sequence (8,7,4). In (g) node (B) 
is reached for the third time and reassigned the sequence (8, 7, 4, 3) of length 4. The optimal path 
for this configuration from A to B is denoted by the solid arrows in (g) (After js^). 

and B on the subset of remaining bonds. If our attempt fails, it means that the removal of 
this bond would destroy the connectivity between A and B. Therefore, we restore this bond 
and exclude it from the list of bonds subject to random removal. With this improvement 
we could reach systems of sizes up to 2^^ nodes and 10^ realizations of weight disorder. 
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E. The Minimum Spanning Tree (MST) 



The MST on a weighted graph is a tree that reaches all nodes of the graph and for 
which the sum of the weights of all the links or nodes (total weight) is minimal. Also, 
in the "strong disorder" limit, each path between two sites on the MST is the optimal 



path [1J,J2J| ^meaning that along this path the maximum barrier (weight) is the smallest 
possible j^, 32, [471. Standard algorithms for finding the MST are Prim's algorithm 3| 
which resembles invasion percolation 25 1 and Kruskal's algorithm First we explain the 
Prim's algorithm. 

(a) Create a tree containing a single vertex, chosen arbitrarily from the graph. 

(b) Create a set containing all the edges in the graph. 

(c) Remove from the set an edge with minimum weight that connects a vertex in the tree 
with a vertex not in the tree. 

(d) Add that edge to the tree. 

(e) Repeat steps (c-d) until every edge in the set connects two vertices in the tree. 

Note that two nodes in the tree cannot be connected again by a link, thus forbidding loops 
to be formed. Prim's algorithm essentially starts by choosing a random node in the network, 
and then growing outward to the "cheapest" link which is adjacent to the starting node. 
Each link which is "invaded" is added to the growing cluster (tree), and the process is iterated 
until every site has been reached. Bonds can only be invaded if they do not produce a loop, 
so that the tree structure is maintained [20]. This process resembles invasion percolation 
with trapping studied in the physics literature [llils^- ^ direct consequence of the invasion 
process is that a path between two sites A and B on the MST is the path whose maximum 
weight is minimal, i.e., the minimal-barrier path. This is because if there were another 
path with a smaller barrier (i.e. maximal weight link) connecting A and B, the invasion 
process would have chosen that path to be on the MST instead. The minimal-barrier path 
is important in cases where the "bottleneck" link is important. For example, in streaming 
video broadcast on the Internet, it is important that each link along the path to the client 
will have enough capacity to support the transmission rate, and even one link with not 
enough bandwidth can become a bottleneck and block the transmission. In this case we will 
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choose the minimal-barrier path rather than the optimal path. An equivalent algorithm for 
generating the MST is the Kruskal's algorithm: 

(a) Create a forest F (a set of trees), where each vertex in the graph is a separate tree. 

(b) Create a set S containing all the edges in the graph. 

(c) While S is nonempty: " Remove an edge with minimum weight from S. " If that edge 
connects two different trees, then add it to the forest, combining two trees into a single 
tree. " Otherwise discard that edge. Note that an edge cannot connect a tree to itself, 
thus forbidding loops to be formed. 

Kruskal's algorithm resembles the percolation process because we add links to the forest 
according to increasing order of weights. The forest is actually the set of percolation clusters 
growing as the occupation probability is increasing. It was noted by Dobrin et al. that 
the geometry of the MST depends only on the unique ordering of the links of the network 
according to their weights. It does not matter if the weights are nearly the same or wildly 
different, it is only their ordering that matters. Given a network with weights on the links, 
any transformation which preserves the ordering of the weights (e.g., the link which has 
the fiftieth largest energy is the same before and after the transformation) leaves the MST 
geometry unaltered. This property is termed "universality" of the MST. Thus, given a 
network with weights, represented by a random variable distributed uniformly, a monotonic 
transformation of the weights will leave the MST unchanged. 

Another equivalent algorithm to find the MST is the "bombing optimization algorithm" 
js^. Similar to the one explained in Section 111 Dl we start with the full network and remove 
links in order of descending weights. If the removal of a link disconnects the graph, we 



restore the link 



42| |: otherwise the link is removed. The algorithm ends and the MST is 



obtained when no more links can be removed without disconnecting the graph. 



F. The Incipient Infinite Cluster (IIC) 

rn 

To find the IIC of ER and SF in uncorrelated weighted networks [66], we start with the 
fully connected network and remove links in descending order of their weights. After each 
removal of a link, we calculate the weighted average degree k = {k'^)/{k), which decreases 



14 



with link removals. When k < 2, we stop the process [26[. The meaning of this criterion 
is explained in the next section, where its connection with the percolation threshold Pc is 
established. The largest remaining component is the IIC. For the two dimensional (2D) 
square lattice we cut the links (bonds) in descending order of their weights until we reach 
the per colation threshold Pc (= 0.5). At that point the largest remaining component is the 
IIC |25|. 

III. OPTIMAL PATH IN STRONG DISORDER AND PERCOLATION ON THE 
CAYLEY TREE. 

In this section we review classical analytical methods for exploring random networks 
based on percolation theory on a Cayley tree 25, 7^1; or branching processes |27]. To obtain 



the optimal path in the strong disorder limit, we present the following theoretical argument. 
It has been shown 



22l |2J] that the optimal path for a ^ oo between two nodes A and 
B on the network can be obtained by the bombing algorithm described in Section IIIDl 
This algorithm is based on randomly removing links. Since randomly removing links is a 
percolation process, the optimal path must be on the percolation backbone connecting A and 
B. We can explore the network starting with node A by Dijkstra's algorithm, sequentially 
creating burning shells of chemical distance n from the node A. Alternatively we can think 
of the n-th shell as of n-th generation of descendants of a parent A in a branching process. 
The random network consisting of a large number of nodes N ^ oo and small average degree 
{k) <^ N, has a tree-like local structure with no loops, since the probability that a node 
we randomly chose by an outgoing link has been already visited is less than {k)'^/N, which 
remains negligible for n < InN/ ln{k). 

As we remove links by the bombing algorithm, the average degree of remaining nodes 
decreases, and the role of loops decreases. Thus finite loops play no role in determining the 
properties of the optimal path. In fact, connecting the nodes A and B by an optimal path is 
equivalent to connecting each of them to a very distant shell on a corresponding Cayley tree. 
As the fraction q = 1 — p of remaining links decreases, we reach the percolation threshold 
at which removal of a next link destroys the connectivity with a very high probability. Note 
that if we select weights of the links = exp(arj), where r, is uniformly distributed on [0, 1], 
the fraction of remaining bonds, p, is equal to rj of the next link we will remove. 
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A. Distribution of the maximal weight on the optimal path 



In order to further develop this analogy, we will show that the distribution of the maxi- 
mal random number Tmax along the optimal path js^ can be expressed in terms of the order 
parameter Poo{p) in the percolation problem on the Cayley tree, where -Poo(p) is the proba- 
bility that a randomly chosen site on the Cayley tree has infinite number of generations of 
descendants or, in other words, belongs to the infinite cluster. 

If the original graph has a degree distribution P{k), the probability that we reach a node 
with a degree k by following a randomly chosen link on the graph, is equal to kP{k)/ (k), 
where (k) is the average degree. This is because the probability of reaching a given node by 
following a randomly chosen link is proportional to the number of links, k, of that node and 
(k) comes from normalization. Also, if we arrive at a node with degree k, the total number 
of outgoing branches is A; — 1 . Therefore, from the point of view of the Cayley tree, the 
probability Pk-i to arrive at a node with k — 1 outgoing branches by following a randomly 
chosen link is 



In the asymptotic limit, — * oo, when the optimal path between the two nodes is very long, 
the probability distribution for the maximal weight link can be obtained from the following 
analysis. Let us assume that the probability of not reaching n — th generation starting from 
a randomly chosen link of the Cayley tree whose links exist with a probability p, is Qn- 
Suppose this link leads to a node whose outgoing degree is 2. Then the probability that 
starting from this link, we will not reach n generations of its descendants is the sum of three 
terms: 

1. The probability that both outgoing links do not exist is equal to {1 — p)^ 

2. The probability that both outgoing links exist, but they do not have n — 1 generations 
of descendants is equal to p'^Q^-i 

3. The probability that only one of the two outgoing links exist but it does not have n — 1 
generations of descendants is equal to 2(1 — p)pQn-i 

Therefore, in this case 



Pk-i = kP{k)/{k). 



(5) 



Qn = {l- pf + p'QLi + 2(1 - p)pQ^ 



n—li 



(6) 
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which on simphfication becomes 



(7) 



Following this argument for the case when our link leads to a node with m outgoing links, 
the probability that starting from this node, we can not reach n generations, is 

Qn = {{l-p)+pQn-ir. (8) 

In the case of a Cayley tree with a variable degree, we must incorporate a factor Pk-i given 
by Eq.(0) which accounts for the probability that the node under consideration has k — 1 
outgoing edges and sum up over all possible values of k. Thus for a conducting link on 
the Cayley tree, the probability that it does not have descendants in generation n can be 
obtained by applying a recursion relation 

oo 

= ^ P(A;)fc((l -P)+ pQi-if^'/{k) (9) 

k=l 

for / = 1, 2, n with the initial condition Qq = 0, which indicates that a given link is always 
present in generation zero of its descendants. 

For a random graph, a randomly chosen node has k outgoing edges with the original 
probability P{k). Thus it has a slightly different probabihty Qnip) of not having descendants 
in its nth generation: 

oo 

Qn = Y.nm^-p)+PQn-lf. (10) 
k=0 

It is convenient to introduce the generating function of the original degree distribution 

oo 

G{x) = Y.P(^)^'' (11) 

k=l 

and the generating function of the degree distribution of the Cayley tree 

G{^)^t^-^^'-\ (12) 
k=i 

where x is an arbitrary complex variable. Using the normalization conditions for the 
probabilities I]fcLo-^(^) ~ 1' easy to see that G{1) = 1. Taking into account that 
{k) = Z^fclo kP{k) we have {k) = dG/dx\x=i = G'{1) and hence G{x) and G{x) are connected 
by a relation 

G{x) = G'{x)/G'{l). (13) 
17 



For any degree distribution P{k) — > 0, as — oo and thus both functions are analytic 
functions of x and have a convergence radius R > 1. Since P{k) > 0, these functions and 
all their derivatives are monotonically increasing functions on an interval [0,1). For the 
ER networks, the degree distribution is Poisson given by: P{k) = (A;)^ exp"'!'^^ /k\, hence 
G{x) = G{x) = exp[(A;)(x — 1)]. For scale free distribution, P{k) ~ k^^, hence G{x) is 
proportional to Riemann ^-function, C\{x). 

If we denote by fn{p), the probability that starting at a randomly chosen conducting link 
we can reach, or survive up to, the n-th generation, then 

fn = l- Qn{p) (14) 

and by fn{p), the probability that a randomly chosen node has at least n generation of 
descendants, 

/n = 1 - Qn{p) (15) 

then 

U = 1 - G{1 - pfn-l) (16) 

and 

/„ = l-G(l-p/„_i). (17) 
The sequence of iterations (fT^ is visualized (See Fig.E)) as a process of solving the equation 

x = l-G{l-px) (18) 

by an iteration method. Obviously, this equation has at least one root Xq = 1. But if 
the derivative of the right hand side, [1 — G(l — px)]'\x=o = pG'{l) > 1, we will have 
another root < Xi < 1. This root has a physical meaning of a probability Poo{p) that a 
randomly selected conducting link is connected to infinity (See also [26]). For p > 1/G"(1), 
the iterations will converge to this root, while for p < 1/G"(1), the iterations will converge 
to Poo(p) = 0. Thus 

" G'{1) (P) -{k) - 1 ^ ^ 

has a meaning of the percolation threshold above which there is a finite probability to reach 
the infinity. Using this equation we can derive the condition k < 2 to stop bombing in the 
process of obtaining IIC. Indeed n < 2 indicates that equation (fTBj) has only one trivial 
solution xq = even for p = 1. This means that all the clusters in this network are finite. If 
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FIG. 2: The iterative process of solving equation ()18() . The thin straight hne y = x represents 
the left hand side. The bold curve represents the right hand side (r.h.s) for p = pc, at which the 
r.h.s. is tangential to y = x at the origin. The dashed curve represents r.h.s. for p > p^. Both 
cases are computed for the Poisson degree distribution with {k) = 2, so r.h.s of Eq. H18() is given 
by 1 — exp(— 2px). The arrows represent iterations starting from /o = 1 (the starting link belongs 
to generation 0). It is clear that the convergence of the iterations is very fast (exponential) for 
p / Pc, while it is very slow (power law) for p = pc- 

K, > 2, Pc < 1 accordingly Poo(l) > 0, i.e. the infinite cluster does exist. The condition k = 2 
corresponds to Pc = 1 which means that any further link removal will produce a network 
in which Poo(l) = ,i-e. the network with only finite clusters, while at p = 1, the infinite 
cluster is incipient. 

The probability that a randomly chosen node is connected to infinity can be determined 

as 



where Poo{p) is a non-trivial solution of Eq. ()18p. For some degree distributions including 
Poisson distribution, Poo(l) < 1- This indicates that a randomly chosen node on the original 
network may not belong to the giant component of the network. In fact, the optimal path 
between nodes A and B exists if both belong to the giant component. Provided that A and 
B both belong to the giant component, the probability that they are still connected when 



PM = ^-G{l-pPoo{p)), 



(20) 
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FIG. 3: The probability distribution of the maximal random number rmax along the optimal path 
obtained using simulations on a random graph with (k) = 4 and using the analytical method on a 
Cayley tree with Poisson degree distribution and (k) = 4. The simulations involve 100000 network 
realizations and are carried out on a network of 65536 nodes. The values of /opt for this network 



lie in the range 40 < /opt < 120 (After 



40|). 



the fraction 1 — p of bonds is removed is 

U{p) = ( SiM^ . (21) 

Translating this condition to the bombing algorithm of generating an optimal path, 11 (p) 
is the probability that the maximum random number along the optimal path rmax < P- 
Indeed, n(p) is the probability that when only a fraction p of links remains the connectivity 
between A and B still exists. Hence r^ax ^ P- Thus, n(rinax) is the cumulative distribution 
of ^max- The probability density of r^ax is thus equal to the derivative of n(p) with respect 
to p: 

P(rmax) = ^n(p)|,=,^,,. (22) 

In Fig. 121 we plot two curves. The curve with symbols is the probability distribution of 
Tinax in a strongly disordered ER graph with (k) = A obtained by simulations. The line shows 
the same probability distribution obtained using (Eq. (|121)) for a Poisson degree distribution 
with {k) = 4. The curves coincide very well, indicating the excellent agreement between the 
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theoretical analysis and simulations. 



B. Distribution of the cluster chemical length at percolation threshold 

Figure |21 illustrates the convergence of the probability of the random link to have de- 
scendants in the n-th generations. The difference P{n) = fn — fn+i is the probability that 
the last generation of the descendants of this link is n. In percolation language, it is the 
probability distribution of the cluster chemical length I = n. In order to find, how /„ ^ 
when "p = Pc, we can expand equation in Taylor series at /„ = 0. For ER networks, 
G{x) has all the derivatives at x = 1, thus (fTBj) can be presented as 

/„ = - lp'G"il)fl, + OULi)- (23) 

For SF graphs with A > 4, G"{1) also exists, thus the above equation holds. For 3 < A < 4 
the second derivative does not exist, however using the Tauberian theorem which relates the 
speed of the decay of the coefficients P{k) ~ A;~^ of the power series and the behavior of its 
singularity at the convergence radius: Gs{x) ~ (1 — xY~^ we can write: 

fn = pG'{l)fn-l - Cft-l + OUtl), (24) 

where c is some positive coefficient. 

As A — > 3, G"(l) oo and hence, according to Eq. p9j) pc ^ 0. This means that for 
SF networks with A < 3, percolation approach breaks down. However, for finite networks, 
it is unlikely to have a degree larger than A^i/('^^i). This fact is obvious since when one 
generates random degrees with probability distribution P{k), one produces random numbers 
X uniformly distributed on a interval between and 1, and compute k = f{x), where 
f{x) satisfies the equation x = J2J{x)P{k) ~ f{x)^^^^. Thus the largest k corresponds to 
the smallest x. Generating random numbers is equivalent to throwing points on an 
interval [0, 1] which divide this interval into A^ + 1 segments whose lengths are identically 
distributed with an exponential distribution. Thus the average value of the smallest x is 



equal to 1/(A^+ 1). Accordingly 

fcmax = /(l/(A^+l))~ArV(A^l) 



:he average value of the largest k can be approximated as 
Thus replacing summation by integration up to fc-may 



in the expression for G"(l) ^ J^'^^'' k'^+^dk ~ k^J"^ = Ar(3-A)/(A-i). g^^^g for 2 < A < 3 [2^ 

p^^iV(A-3)/(A^i). (25) 
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When p < pc, fn ~ (p/Pc)"', i-e. the convergence is exponential When p = pc, we will 
seek the solution of the above recursion relations in a power law form: /„ ~ n~^. Expanding 
them in powers of n~^, and equating the leading powers, we have 9 + 1 = 9{X — 2), from 
which we obtain 

fn-n-y('-'\ (26) 



or 



where 



P(£) = ~ (27) 



2, A > 4 ER 
r,= ' . (28) 

(A^ + l> 3<A<4 

The probability that a randomly selected node has exactly i generations of descendants is 
equal to 

P{e) = h- fi+i = G{1 - pfe) - G{1 - pfe-,) ~ {k)p{fe - fe-i). (29) 

Thus it is characterized by the same as P{i). 

Taylor expansions (PHj) and can be used to derive the behavior of Poo (p) asp ^ pc by 
letting fn = fn-i = Pooijp) and solving the resulting equations with a leading term accuracy: 

Poo{p) = {p-Pcf, (30) 

where Q 

' 1, A > 4 ER 
/3= ' . (31) 

A- 3, 3<A<4 

C. Distribution of the cluster sizes at percolation threshold 



Using the generating functions [a |46|, |53[ , one can also find the distribution of the clusters 
sizes, P(s), connected to a randomly selected link. For simplicity, let us again consider a 
link (conducting with probability p) leading to a node of a degree = 3, so it has only two 
outgoing links. The probability that this link is connected to a cluster consisting of s nodes 
obeys the following relations 

P{s)=p mP{l) (32) 

k+l=s-l 
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for s > and -P(O) = 1 — p. Introducing the generating function of the cluster size distribu- 
tion H{x) = X]o° P{s)x'^, we have: H{x) = 1 — p + xpH'^{x). In a general Cayley tree with 
an arbitrary degree distribution we have: 

H{x) = l- p + xpG{H{x)). (33) 

This equation defines the behavior of H{x) for x 1, and thus via the Tauberian theorem 
defines the asymptotic behavior of its coefficients P{s). Note that H{1) is the cumulative 
probability of all finite clusters. Thus {1—H{1)) = pPoo{p) is the probability that a randomly 
selected link conducting with probability p is connected to the infinity and Eq. (j33|) becomes 
equivalent to Eq. (fTS|) for P^oip)- 

Introducing 5x = I — x and 6h = 1 — H{x) and expanding G{x) around x = 1 at 
percolation threshold p = we have 5h5x + p5x = cxS^^ + 0{5^^) which yields 

Sh ~ ^l^^'^^'^^- Using the Tauberian theorem we conclude 

P{s) ~ (34) 

where 

f 3/2, A > 4 ER 
rs = { ' ' . (35) 

^ + 3<A<4 

Analogous considerations suggest that the probabilities P{s) that a randomly selected node 
belongs to the cluster of size s produce the generating function H{x) = G{H{x)). Since for 
A > 3, G"{1) < oo, the singularity of H{x) for x ^ 1 is of the same order as the singularity 
of H{x) and thus its coefficients, P{s), also decay as s""^'. 

Following [t^], we will show that the distribution of all the disconnected clusters in a 
network scales as Paii^s) = P{s)/s ~ 5^"^^+^. Indeed, let us select a random node in this 
network. The number of nodes belonging to the clusters of size s is NsPau{s)/ J2T ^Paa{s) = 

NsPall{s)/{s). Thus, P{S) = sPau{s)/{s). 

If we have a network of nodes, the size of the largest cluster S is determined by the 
relation J^'^s Paii{s) ~ ^/N, which becomes clear if we describe a concrete realization of the 
cluster sizes by throwing N/ (s) random points representing clusters under the curve Pau{s). 
The average area corresponding to each of these points is and the area corresponding to 
the rightmost point representing the largest cluster is Z^^s -Pa//(s) ~ S^'^\ Thus the largest 
cluster (which coincides with IIC) in the network of nodes scales as 

S ^ N^/^\ (36) 
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For ER graphs, the relation S ~ N'^/^ has been derived in a classical work 



0. 



IV. SCALING OF THE LENGTH OF THE OPTIMAL PATH IN STRONG DIS- 
ORDER 

The relations obtained in the previous subsections allow us to determine the scaling 
of the average optimal path length in a network of nodes. When during bombing, we 
reach percolation threshold, we have targeted only a tiny fraction of links (or nodes) on the 
optimal path, with Vmax > Pc which we have to restore, because their removal would destroy 
the connectivity. The majority of the links on the optimal path remains intact. All of them 
belong to the remaining percolation clusters which at percolation threshold has a tree like 
structure with no loops. At this point, the optimal path coincides with the shortest path, 
which is uniquely determined. We will describe this situation in detail in Sectior fVll With 
high probability, the optimal path between any two nodes A and B goes through the largest 
cluster at the percolation threshold. Thus its length must scale as the chemical length of 

n 

the largest percolation cluster Assuming a power law relation between the cluster size 
s and its chemical dimension i, s = f*^^, and using the fact that both of the quantities have 



power law distributions P{£)di = P{s)ds, we have = i-de-rs+de-i _ rjj^^g 

de = (r, - l)/(r, - 1). (37) 
Therefore, S ~ Ig^f and using we have £opt ~ S^^'^'^ ~ JSf^opt ^ where 

Vopt = l/{diT,). (38) 
Using Eqs. and for and respectively, we have 



^opt 



1/3, A > 4, ER 
(A -3)/(A - 1), 3 < A < 4 



(39) 



Note that A = 4 corresponds to the special case when G"{1) diverges, in this case the 
Tauberian theorem predicts logarithmic corrections, and hence we expect £opt ~ N^/^ /\nN 
for A = 4. 

We review above the exact results for the Cayley tree, from which using heuristic argu- 
ments we have derived the scaling relation between the average length of the optimal path 



24 




FIG. 4: (a) Plot of 

^opt as a function of in double logarithmic scale for the optimal path length 
in strong disorder using the two numerical methods discussed in the text: (i) results obtained using 
the "bombing" approach (o) and (ii) results obtained using the ultrametric approach (x). The 
dashed line shows the slope 1/3. (b) Successive slopes fopt(-^) as a function of l/N'^/'^ for the 
optimal path length in strong disorder using the two methods described in the text. The symbols 
denote the same as in (a). The dashed line is the quadratic fitting of the results showing that 
the extrapolated value of the effective exponent in the limit N ^ oo app roaches 1/3. This result 



coincides with our theoretical value Vopt = 1/3 asymptotically (After 



and the number of nodes in the network. Now we will show how the same predictions can 
be obtained using general percolation theory. We will also present numerical data support- 
ing our heuristic arguments. We begin by considering the ER graph. At criticality, it is 
equivalent to percolation on the Cayley tree or percolation at the upper critical dimension 
dc = Q- For the ER graph, we derived above that the mass ot the lie. S, =cate as iVV' 0. 
This result can also be obtained in the framework of percolation theory for dc = 6. Since 
S ~ -R'^^ and N (where df is the fractal dimension and R the spatial diameter of the 

cluster), it follows that S ~ N'^f^'^ and for d^ = 6, df = 4 Q we obtain S ~ A^^/s 3^. 

It is also known 2^ that, at criticality, at the upper critical dimension, the average 
shortest path length fmin ~ R^, hke a random walk and therefore S ~ ^^j^ with de = 2. 
Thus 

Cin ~ Lpt ~ S^/'^' ~ N^/^'^' ~ AT^opt , (40) 
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where i^opt = 2/3^^ = 1/3. 



For SF networks, we can also use the percolation results at criticality. It was found 
(see Sec. EH) that rf^ = 2 for A > 4, de = {X - 2)/(A - 3) for 3 < A < 4, ^ ~ iV^/s ^ > A, 
and S ~ A^(^-2)/(A-i) for 3 < A < 4. Hence, we conclude that 

A>4 

1 iV(A-3)/(A-i) 3<A<4 
Thus Uopt = 1/3 for ER and SF with A > 4, and u^pt = (A-3)/(A-l) for SF with 3 < A < 4. 
Since for SF networks with A > 4 the scaling behavior of £opt is the same as for ER graphs 
and for A < 4 the scaling is different, we can regard SF networks as a generalization of ER 
graphs. 

Next we describe the details of the numerical simulations and show that the results agree 
with the above theoretical predictions. We perform numerical simulations in the strong 
disorder limit by the method described in Section III Dl for ER and SF networks. We also 
perform additional simulations for the case of strong disorder on ER networks using the 
ultrametric optimization algorithm (see Section fll C|) and find results identical to the results 
obtained by randomly removing links. In Fig. Efa) we show a double logarithmic plot of 
-t-opt ^s a function of for ER graphs. To evaluate the asymptotic value for z/opt we use 
for both approaches successive slopes, defined as the successive slopes [2^ of the values on 
Fig.|3J One can see from Fig. |3fb) that their value approaches 1/3 when ^ 1, supporting 
Eq. PH). 

The theoretical considerations [Eqs. pUj) and ()41|)] predict that SF graphs with A > 4, 
are similar to ER with £opt ~ N^^^, while for SF graphs with 3 < A < 4, iopt ~ A^(^~3)/(a-i)_ 
Figure shows data from numerical simulations supporting the linear behavior of £opt 
versus N^^^ for A > 4. The quality of the linear fit becomes poor for A ^ 4. At this 
value, there are corrections probably due to logarithmic divergence of the second moment 
of the degree distribution, i.e., iopt ~ N^^^/\nN (see Fig. Eb)- Figure Efc shows results of 
simulations supporting the asymptotic linear behavior of iopt versus A^('^^'^)/('^^i) for 3 < A < 
4. Theoretically, as A — >• 3, i^opt = (-^ ~ 3)/(A — 1) — 0, and thus one can expect for A = 3 
a logarithmic A^ dependence of £opt- Indeed, for 2 < A < 3 our numerical results for the 
strong disorder limit suggest that £opt scales slower than a power law with A^ but slightly 
faster than In A^. The numerical results can be fit to £opt ~ (In A^)'^~^ (see Fig. EJi). Note 
that the correct asymptotic behavior may be different and this result may represents only a 
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FIG. 5: Results of numerical simulations, (a) The dependence of ^opt on N^^'^ for A > 4. (b) 
The dependence of iopt/ In on N^^^ for A = 4. (c) The dependence of £opt on A^('^^3)/(A-i) £qj. 
3 < A < 4. (d) The dependence of (opt on In for A < 3 (After Q, Q)- 

crossover regime. The exact nature of the percolation cluster at A < 3 is not clear yet, since 
in this regime the transition does not occur at a finite (non zero) critical threshold . We 
obtain similar results for SF networks where the weights are associated with nodes instead 
of links. 



V. SCALING OF THE LENGTH OF THE OPTIMAL PATH IN WEAK DISOR- 
DER 

When a = 1/kT — > 0, all the Tj essentially contribute to the total cost. Thus T —>■ (yo 
(very high temperatures) corresponds to weak disorder limit. We expect that the optimal 
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path length in the weak disorder case will not be considerably different from the shortest 
path, as found also for regular lattices j3l and random graphs Thus we expect that 
the scaling for the shortest path will also be valid for the optimal path in weak disorder, but 
with a different prefactor depending on the details of the graph and on the type of disorder. 
We simulate weak disorder by selecting < < 1 from a uniform distribution. To compute 



^opt we use the Dijkstra algorithm (See Section |TTB} |.l 3] . The scaling of the length of the 
optimal path in WD for ER, is shown in Fig. IHfa). Here we plot £opt as a function of In 
for {k) = 4. The weak disorder does not change the scaling behavior of iopt on ER compared 



to 



only the prefactor. 




LnN 
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FIG. 6: Results of numerical simulations, (a) The linear dependence of iopt on In N for ER graphs 
in the weak disorder case for < k >= 4. The dashed line is used as a guide to show the linear 
dependence, (b) The dependence of ^opt on In N for SF graphs in the weak disorder case for various 
values of A. The different curves represent different values of A from 2.5 (bottom) to 5 (top) (After 
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For SF networks, the behavior of the optimal path in the weak disorder limit is shown 
in Fig. Mjo) for different degree distribution exponents A. Here we plot £opt as a function 
of InA^. All the curves seem to have linear asymptotes. This result is analogous to the 
behavior of the shortest path £min ~ InA^ for 3 < A < 4 and ER. Note, however, that for 
2 < A < 3, £min scale as InlnA^ [7]. Thus, iopt is significantly larger and scales as InA^ 
(Fig. 3b). Thus, weak disorder does not change the universality class of the length of the 
optimal path except in the case of "ultra-small" worlds 2 < A < 3, where £opt ~ ^wi^min), 
and the networks become small worlds. 
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VI. CROSSOVER FROM WEAK TO STRONG DISORDER 



A. Exponential Disorder 



Consider the case of finite a (T > 0). In this case we expect a crossover in the length 
of the optimal path (or the system size N) from strong disorder behavior to weak disorder 
depending on the value of a. In order to study this crossover we have to use an implemen- 
tation of disorder that can be tuned to realize narrow distributions of link weights (WD) as 



well as broad distributions of lin 
the disorder is as follows 
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c weights (SD). The procedure that we adopt to implement 



3G 



22] (See Sec. IIVI A). Assign to each link i of the network 



a random number r^, uniformly distributed between and 1. For the analogy with the ther- 
mally activated process described in Sec. IIVI the rj play the role of the energy barriers. The 
transit time or cost associated with link i is then Tj = exp(arj), where a controls the strength 
of disorder i.e., the broadness of the distribution of link weights. The limit a — oo is the 
strong disorder limit, where a single link dominates the cost of the path. For (i-dimensional 



lattices of size L, the crossover is found 



301 1 to behave as 



L^-SL«a- ^^^^ 



where z/ is the percolation correlation exponent 0,13- -^^^ d = 2, dopt ~ 1-22 and for d 
dopt ~ 1.44 124 



30|. Here we show [43] that for any network of size N and any finite a, there 
exists a crossover network size A^* (a) such that for <^ A^* (a) the scaling properties of the 
optimal path are in the strong disorder regime, while for ^ N*{a) the typical optimal 
paths are in the weak disorder regime. We evaluate below the function N*{a). 

In general, the average optimal path length ioptid) in a weighted network depends on a 
as well as on A^. In the following we use instead of A^ the min-max path length i^o which is 
related to A^ as £oo = ^opt(oo) ~ N''°p^ [Eqs.fjin| and (jH}] and hence A^ can be expressed in 
terms of i^o, 

N ~ (43) 

Thus, for finite a, £opt(a) depends on both a and ioo- We expect a crossover length i*{a), 
which corresponds to the crossover network size N*{a), such that (i) for i^o <^ ^*(o), the 
scaling properties of iopt{a) are of the strong disorder regime, and (ii) for i^o ^ the 
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scaling properties of £opt(o) are of the weak disorder regime. In Fig. [71 we show a schematic 
representation of the changes of the optimal path as the network size increases. 




N<N*(a) 
Strong Disorder 



N = N*(a) 



N > N*(a) 
Weak Disorder 



FIG. 7: Schematic representation of the transition in the topology of the optimal path with system 
size for a given disorder strength a. The solid line shows the optimal path at a finite value of a 
connecting two nodes indicated by the filled circles. The portion of the min-max path that is distinct 
from the optimal path is indicated by the dashed line, (a) For N <^ N*{a) (i.e. ioo ^ 
the optimal path coincides with the min-max path, and we expect the statistics of the SD limit. 

(b) For N = N*(a) (i.e. £00 = the optimal path starts deviating from the min-max path. 

(c) For N ^ N*(a) (i.e. £00 ^ ^*(^))) the optimal path has almost no links in common with the 



min-max path, and we expect the statistics of the WD limit (After 



In order to study the transition from strong to weak disorder, we introduce a measure 
which indicates how close or far the disordered network is from the limit of strong disorder. 
A natural measure is the ratio 

4pt(a) 



W{a) 



(44) 



Using the scaling relationships between £opt(fl) and in both regimes, and £00 ~ N'^°p^, we 
get 



£00 ~ N''°^' [SD] 
\n£^^\nN [WD]. 



From Eq. (jlH) and Eq. (gS)) it follows. 



W{a) 



const. [SD] 



ln£oo/^oo [WD]. 



We propose the following scaling Ansatz for W{a) 

W{a) = f{4^ 



(45) 



(46) 



(47) 
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where 



with 



F(u) 



const. u <^ 1 
\n{u)/u l' 



u 



e*{a)' 



(48) 



(49) 



We now develop analytic arguments ^47] to obtain the dependence of the crossover length 
i* on the disorder strength a. These arguments will also give a clearer picture about the 
nature of the transition of the optimal path with disorder strength. 

We begin by making few observations about the min-max path. In Fig. |H1 we plot the 
average value of the random numbers r„ on the min-max path as a function of their rank 
n {1 < n < ioo) for ER networks with (k) = 4 and for SF networks with A = 3.5. This 
can be done for a min-max path of any length but in order to get good statistics we use the 
most probable min-max path length. We call links with r < pc "black" links, and links with 
r > Pc "gray" links, following the terminology of loselevich and Lyubshin [42,] where pc is 
the percolation threshold of the network 
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FIG. 8: Dependence on rank n of the average values of the random numbers along the most 
probable optimal path for (a) ER random networks of two different sizes = 4096 (□) and 



N = 16384 (o) and, (b) SF random networks (After 



We make the following observations regarding the min-max path: 

(i) For r„ < pc, the values of r„ decrease linearly with rank n, implying that the values 
of r for black links are uniformly distributed between and pc, consistent with the 
results of Ref. [43 1. This is shown in Fig. |H1 

(ii) The average number of black links, (ib), along the min-max path increases linearly 
with the average path length £00 • This is shown in Fig. 



31 



(iii) The average number of gray links {ig) along the min-max path increases logarithmically 
with the average path length ^oo or, equivalently, with the network size A^. This is 
shown in Fig. ^jp. 
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FIG. 9: (a) The average number of links (^h) with random number values r < pc on the min-max 
path plotted as a function of its length l^o for an ER network, showing that {ih) grows linearly 
with loo- (b) The average number of links {^g) with random number values r > pc on the min-max 
path versus InA^ for an ER network, showing that {ig) ~ InA^. The inset shows the successive 



slopes, indicating that in the asymptotic limit 
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The simulation results presented in Fig. El are for ER networks; however, we have confirmed 

that the observations (ii) and (iii) are also valid for SF networks with A > 3 HQ- 
Next we discuss our observations using the concept of the MST. The path on the MST 

between any two nodes A and is the optimal path between the nodes in the strong 

disorder limit — i.e, the min-max path. 

In order to construct the MST we use the bombing algorithm (See Section ^iD)- At the 

point that one cannot remove more links without disconnecting the graph, the number of 

remaining black links is 

iV.^^. (50) 

The black links give rise to Nc disconnected clusters. One of these is a spanning cluster, 
called the giant component oi IIC (see Section |n] F). The clusters are linked together 
into a connected tree by exactly A^^^ — 1 gray links (see Fig. ITIH) . Each of the N^. clusters is 
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itself a tree, since a random graph can be regarded as an infinite dimensional system, and 
at the percolation threshold in an infinite dimensional system the clusters can be regarded 
as trees. Thus the N^. clusters containing A*";, black links, together with N^. — '\- gray links 
form a spanning tree consisting of A^";, + A^^^ — 1 links. 





FIG. 10: Schematic representation of the structure of the minimal spanning tree, at the percolation 
threshold, with G being the giant component. Inside each cluster, the nodes are connected by black 
links to form a tree. The dotted lines represent the gray links which connect the finite clusters to 
: brm the gray tree. In this example Nc = ^ and the number of gray links equals A'c — 1 = 3 (After 



Thus the MST provides all min-max path between any two sites on the graph. Since the 
MST connects all nodes, the number of links on this tree must be A^ — 1, so 



N, + N, = N. 



(52) 



From Eq. ^ and Eq. ^ it follows that 

Nr = N\l 



{k)Pc 



(53) 



Therefore Nc is proportional to A^. 

A path between two nodes on the MST consists of ib black links. Since the black links 
are the links that remain after removing all links with r > Pc, the random number values r 
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on the black links are uniformly distributed between and pc in agreement with observation 
(i) and Ref. 3- 

Since there are A^^^ clusters which include clusters of nodes connected by black links as 
well as isolated nodes, the MST can be described as an effective tree of A^^^ "super" nodes, 
each representing a cluster, and A^^^ — 1 gray links. We call this tree the "gray tree" (see 
Fig. I10|). This tree is in ta. a .a,e free .ee Q de,ee exponent A, ^ .5 to/ER 
networks and for scale for networks with A > 4, and = (2A — 3)/(A — 2) for SF networks 
with 3 < A < 4. If we take two nodes A and B on the original network, they will most likely 
lie on two distinct effective nodes of the gray tree. The number of gray links encountered 
on the min-max path connecting these two nodes will therefore equal the number of links 
separating the effective nodes on the gray tree. Hence the average number of gray links 
(ig) encountered on the min-max path between an arbitrary pair of nodes on the network is 
simply the average diameter of the gray tree. Our simulation results (see Fig. Eb) indicate 
that 

{eg)^\nN. (54) 

Since (ig) ~ In^oo C-oo-, the average number of black links on the min-max path 
scales as £oo in the limit of large ^oo in agreement with observation (2) as shown in Fig. 

Next we discuss the implications of our findings for the crossover from strong to weak 
disorder. From observations (i) and (ii), it follows that for the portion of the path belonging 
to the giant component, the distribution of random values r is uniform. Hence we can 
approximate the sum of weights by [l^, 

exp(arfc) ^ — / expar dr = (exp(apc) — 1) = exp(ar*), (55) 

^1 Pc Jo apc 

where r* pc + {1/ a) ln((£f,) /apc)- Since ^ £oo, 



p^ + lln(121). (56) 



a \apc 



Thus restoring a short-cut link between two nodes on the optimal path with pc < r < r* 
may drastically reduce the length of the optimal path. When apc ^ ^oo, < Pc and such 
a link does not exists, if ioo > apc, the probability that such a link exist becomes positive. 
Hence when the min-max path is of length i^o ~ clPc, the optimal path starts deviating from 
the min-max path. The length of the min-max path at which the deviation first occurs is 
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precisely the crossover length i*{a), and therefore i*{a) ~ apc- In the case of a network with 
an arbitrary degree distribution we can write using Eq. (j^D), t{a) ~ a 



(k) 



Note that in the case of SF networks, as A ^ 3+, pc approaches zero and consequently 
i*{a) 0. This suggests that for any finite value of disorder strength a, a SF network with 
A < 3 is in the weak disorder regime. We perform numerical simulations and show that the 
results agree with our theoretical predictions. For the details of our simulation methods see 
Section im 

From our theoretical arguments, £*{a) ~ a and therefore, from Eq. ()47|) . W{a) must be a 
function of ioo/d- In Fig. ^2 we show the ratio W{a) for different values of a plotted against 
£oo/^*(a.) = ioo/d for ER networks with (k) = 4 and for SF networks with A = 3.5. The 
excellent data collapse is consistent with the scaling relations Eq. ()47j) . Fig. fT^ shows the 
scaled quantities W{a)u = (!-opt{a) / 1 {a) vs. Inw = ln(£oo/^*(a')) = ln(£oo/'3), for both ER 
networks with {k) = 4 and for SF networks with A = 3.5. The curves are linear at large 
u = ioo/£*{0'), supporting the validity of the logarithmic term in Eq. (j^8|l for large u. 
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FIG. 11: Test of Eqs. and (gHI). (a) W{a) plotted as a function of ioo/a for different values 
of a for ER networks with (k) = 4. The different symbols represent different a values: a = 8(o), 
a = 16(n), a = 22(o), a = 32(A), a = 45(+), and a = 64(*). (b) Same for SF networks with 
A = 3.5. The symbols correspond to the same values of disorder as in (a). The insets show W{a) 
plotted against logf/ fy/a), a nd indicate for ioo <^ a, W{a) approaches a constant in agreement 



with Eq. (gHI) (After 



To summarize, for both ER random networks and SF networks we obtain a scaling func- 
tion for the crossover from weak disorder characteristics to strong disorder characteristics. 
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FIG. 12: (a) Plot of W{a)u = iopt{a)/t{a) = vs Inu = hi{£^/t{a)) = \n{l^/a) for ER 

networks with (A;) = 4 for different values of a. (b) Plot of W{a)u = ^opt (a) /^* (o) = ^opt(a)/a vs 



Inn = ln(^oo/^*(a)) = ln(^oo/a) for SF networks with A = 3.5. 



symbols in (a) and (b) are the same as in Fig. Q (After 
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4^) 



he values of a represented by the 



We show that the crossover occurs when the min-max path reaches a crossover length (!,*{a) 
and I* {a) ~ a. Equivalently, the crossover occurs when the network size reaches a 
crossover size N*ia), where N*ia] ~ a? for ER networks and for SF networks with A > 4 



and N*ia] ~ a ^-3 for SF networks with 3 < A < 4. 



B. General Disorder: Criterion for SD, WD crossovers 

Until now we considered a specific form of -P(t) = P(r, a) = l/{aT) with 1 < r < e". 
The question is what happens for other distributions of weights and what is the general 
criterion to determine which form of Pir) can lead to strong disorder, and what is the 

n 

general condition for strong or weak disorder crossover. We present analytical results [73|| 
for such a criterion which are supported by extensive simulations. Using this criterion we 
show that certain power law distributions and lognormal distributions, P(t, a), where a is a 
parameter determining the broadness of the distribution, can lead to strong disorder and to 



a weak-strong disorder crossover [30|, |32, |47| . We also show that for P(r, a) uniform, Poisson 
or Gaussian, only weak disorder occurs regardless of the broadness of P(r, a). Importantly, 
we find that for all P('r, a) that possess a strong- weak disorder crossover, the distributions 
of the optimal path lengths display the same universal behavior. 



36 



If we express r in terms of a random variable r uniformly distributed in [0, 1), we can use 
the same gray and black link formalism as in the previous section. This can be achieved by 
defining r(r) by a relation: 

r(r) = r P{T',a)dT'. (57) 



Solving this equation with respect to r gives us T(r, a) — f{r, a), where /(r, a) satisfies the 
relation 

ff{r,a) 

r= / P{T',a)dT'. (58) 
Jo 

For a strong disorder regime, the sum of the weights of the black links on the IIC must be 
smaller than the smallest weight of the removed link Tc = f(pc,a): 

J2^i^J2f(^i^^) <^c, (59) 
i=l i=l 

where are independent random variables uniformly distributed on [0,Pc]- As we shown 
above, ib £oo so in the following we will replace it, by the average path length in the 
strong disorder limit, i^o- The transition to weak disorder begins when the probability that 
this sum is greater than Tc becomes substantial. The investigation of this condition belongs 
to the realm of pure mathematics and can be answered explicitly for any functional form 
/(r, a). This condition is satisfied when the mathematical expectation of the sum is greater 



than Tr. 



c- 

, Pc 

--00 ' 



^ f(r,a)dr>r,. (60) 

Pc Jo 

Thus the crossover to weak disorder happens if 

. , _ fiPc,a)Pc _ r,p, 

°° IS' fir, a)dr rP(r, a)dr ' ^"'^ 

where A plays the role of the disorder strength and Tc satisfies the equation Pc — 
Jq" P{T,a)dT. In order for the strong disorder to exist for any network size A^, the disorder 
strength must diverge together with the parameter a of the weight distribution a — > oo. In 
order to determine if a network exhibits a strong disorder behavior it is useful to introduce 
a scaling variable 

Z = £00/ A, (62) 

so that if Z ^ 1 the network is in the weak disorder regime and if Z <^ 1, the network is in 
the strong disorder regime. 
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Note that if /'// > Aq on the entire interval [0, 1], then A > AqPc. Thus another sufficient 
condition for a strong disorder to exist is 



/'// > ^oo/Pc 



(63) 



For the exponential disorder function r = exp(ar), we have /'// = and thus Eq. ()63p 
coincides with the condition of strong disorder apc > ioo derived in the previous Section. 

In the following, we will show how the above condition is related to the strong to weak 
™.ove. co„di«on fo. optical pa.. o„ .a.ice. fly Q. Fo. .he op.i.al pa.h . 
strong disorder limit connecting the opposite sides of the lattice of linear size L, the largest 
random number ri follows a distribution characterized by a width which scales as L~^^'^, 



25 



50 
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. The transition to 



where u is the percolation connectivity length exponent 
weak disorder starts when the optimal path may prefer to go through a slightly larger value 
r2, taken from the same distribution and thus r2 — ri ~ pcL^^^'^. The condition for this to 
happen is [/(r2) — < 1, which is equivalent to 



/'// < L'/yp, 



(64) 



Now we will show that this condition is equivalent to (j63p . Percolation on Erdos-Renyi 
(ER) networks is equivalent to percolation on a lattice at the upper critical dimension 



dr = 6 
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46|. For = 6, L ~ N'^/^, and i/ = 1/2. Thus indeed L^^^ ~ N^/i^c^) ~ (3^. 
Following similar arguments for a scale-free network with degree distribution P(k) ~ A;^^ 
and 3 < A < 4, we can replace L'^/" by A^-(^"3)/(a-i) gj^^^ ^ 2(A-l)/(A-3) [46]. Thus, 
due to Eq. (jSHl) L^^'" ~ ^00 and we can introduce the analogous scaling parameter Z for 
lattices: 

Z = At^. (65) 



Pcf'/f 

Next we calculate A for several specific weight distributions -P(t) [73[. We begin with the 
well-studied exponential disorder function f{x) = e"'', where r is a random number between 
and 1 Q, Q- From Eq. (jSHl) follows that P(r, a) = l/(ar), where r G [l,e"]. Using 
Eq. ()6H1 we have 



A = apcTc/{Tc - 1) ~ apc, 



(66) 



For fixed A, but different a and Pc-, we expect to obtain the same optimal path behavior 



Indeed, this has been shown to be valid 
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68, 
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Next we study A for the disorder function f{r,a) = r"", with r between and 1 where 

If ' 

a > 69]. For this case the disorder distribution is a power law P(r, a) = a~^r^/'*^^. 
Following Eq. (|HT|) we obtain 

A = a + 1 ~ a. (67) 

Note that here a plays a similar role as a in Eq. but now A is independent of pc, which 
means that networks with different pc, such as ER networks with different average degree 
< k >= 1/pc, yield the same optimal path behavior. 

For the power law distribution with negative exponent /(r) = (1 — r)^"" (a > 0), we have 
P(T,a) = a-W-^-'^/" and 



[i-p,y--i i-pc' ^ ^ 



We further generalize the power law distribution with the disorder function f{r,a) = 
by introducing the parameter < A < 1 which is defined as the lower bound of the uniformly 
distributed random number r, i.e., 1 — A < r < 1 \q^. Under this condition, the distribution 
becomes 

P(r,a) = ^ rG[(l-A)M]. (69) 



Again using Eq. (pT|) . we obtain 



PcA + l-A ^ ^ 



Table H] shows the results of similar analysis for the lognormal, Gaussian, uniform and 
exponential distributions P{t, a). From Table |1J we see that for exponential function, power 
law and lognormal distributions, A is proportional to a and thus can become large. However 
for uniform, Gaussian and exponential distributions, A is limited to a value of order 1, so 
Z ^ 1 for large and the optimal path is always in the weak disorder regime. Note that 
for these distributions, A is independent of a. 

In general, one can prove that A ^ oo for a given distribution P(r, a) as a — oo if there 
exist a normalization function c(a) and a cutoff function r(a) such that for Ve > 0, VE > 0, 
3M > such that for a > M and r G [T(a), r(a)E], \tP{t, a)c{a) — 1\ < e. We will call such 
functions P(r, a) "quasi-l/r" distributions because they behave as 1/r in a wide range of r. 
Obviously that exponential, power-law and lognormal distributions are quasi-l/r functions, 
so for them, for large enough a we can observe a strong disorder. 
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TABLE I: Parameters controlling the optimal paths on networks for various distributions of disorder 
(After Q), 



Name 


Function 


Distribution 


Domain 


.4 


Inverse 




j_ 

ar 


r G [l,e1 


apc 


Power Law 




a 


T e (0, 1] 


a 


Power Law 


{l-xf 


1^1+1/a 
a 


r G [1,00] 


l-pc 


Power Law 




T-l/«-l 

|a|A 


r G [(l-Af,!] 


Ol-(l-p,)A 


Lognormal 


gV2acrf-i(2a;-l) 


g-(lnT)2/2a2 

ra\/27r 


r G (0,cx)) 


V^Pc 

"'e-M-^(2pc-i)l2 


Uniform 


ax 


1/a 


r G [0, a] 


2 


Gaussian 


^/2aerf-i(x) 


2e-r2/(2a2) 


T G [0,00) 


•\/27rpcCrf 

l_g-[erf-i{pc)]2 


Exponential 


— aln(l — x) 


a 


T G [0, 00) 


-Pcln(l-pc) 
Pc+{l-Pc)ln(l-pc) 



To test the validity of our theory, we perform simulations of optimal paths in 2d square 
lattices and ER networks. Random weights from different disorder functions were assigned 
to the bonds. For an Lx L square lattice, we calculate the average length iopt of the optimal 
path from one lattice edge to the opposite. For an ER network of nodes, we calculate iopt 
between two randomly selected nodes. 

Simulations for optimal paths on ER networks are shown in Fig. El Here we use the 
bombing algorithm (See Section|n]D) to determine the path length £00 in the strong disorder 
hmit, which is related to A^ by £00 ~ N"""' = N^^^ [2^ (See Section |TX|). We see that for 
all disorder distributions studied, iopt scales in the same universal way with Z = ioo/A. For 
Z ^ 1, iopt/ A is linear with log(£ooM) expected (Fig. 3a). For small Z = ioo/A (Fig. [THl 
b), iopt oc £00 ~ A^"*^^^, which is the strong disorder behavior Thus, we see that when A^ 
increases, a crossover from strong to weak disorder occurs in the scaled optimal paths iopt/ A 
vs. Z. Again, the collapse of all curves for different disorder distributions for ER networks 
supports the general condition of Eq. (jU^ . 

Next we use Eq. ()62|) to analyze the other types of disorder given in Table H] that do 
not have strong disorder behavior. For a uniform distribution, P{r) = 1/a and we obtain 
A = 1. The parameter a cancels, so Z = L^^'^ for lattices, and Z = N'^^^ for ER networks. 
Hence for any value of a, Z << 1, and strong disorder behavior cannot occur for a uniform 
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FIG. 13: The function d-opt/^ for ER networks after scaling, where (a) is a hnear-log plot and {h) 
is a log-log plot. Distributions used are power law x'^ with 10 < a < 30 where < x < 1, 
with a = 30 and the range of A < x < 1 with A = 0.6 or 0.8, and lognormal distribution with 
10 < a < 30. The straight line in (a) indicates weak disorder and the straight line in (6) indicates 



strong disorder (After 



distribution. 




FIG. 14: The optimal path for Gaussian distribution of weights for ER networks. Note that these 
curves would collapse after scaling to the curves in Fig. ^Jin the weak disorder tail of large Z^^ 
(After Q). 



Next we analyze the Gaussian distribution. We assume that all the weights Tj are positive 
and thus we consider only the positive regime of the distribution. Using Eq. (pT|) we obtain 



_ V2^Pcerf-^ 

1 _p-[erf-l(p,)]2- y'^) 
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The disorder is controlled solely by pc which is related only to the type of network, and A 
cannot take on large values. Thus, also for the Gaussian P(r, a), all optimal paths are in the 
weak disorder regime. Similar considerations lead to the same conclusion for the exponential 
distribution where A = — r/f''^'^^t~f-!'^ — r- Simulation results for the Gaussian distribution 
shown in Fig. El display only weak disorder (i.e. no weak-strong disorder crossover), thus 
supporting the above conclusions. 

To summarize, in this section we presented a criterion for the inverse disorder strength 
Z on the optimal path in weighted networks for general distributions P{T,a). We show an 
analytical expression, Eq. (jfilj) . which fully characterizes the behavior of the optimal path. 
Simulation of several distributions support these analytical predictions. It is plausible that 
the criterion of Eq. (jfilll is valid also for other physical properties in weighted networks — 
such as conductivity and flow in random resistor networks — due to a recently-found close 



relation between the optimal path and flow 
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VII. SCALING OF OPTIMAL-PATH-LENGTHS DISTRIBUTION WITH FINITE 
DISORDER IN COMPLEX NETWORKS 

In this chapter we present further support for the general analytical results pre- 
sented in section IVIB . The question is how the different optimal paths in a network are 
distributed? The distribution of the optimal path lengths is especially important in commu- 
nication networks, in which the overall network performance depends on the different path 
lengths between all nodes of the network, and not only the average. Ref. [47 1 studied the 
probability distribution P{iopt) of optimal path lengths in an ER network in the SD limit. 
The scaled curve for P(£opt) for different network sizes is shown in Fig. ^1 in a log-log plot. 
We find that similarly to the behavior of self avoiding walks [3] there are two regimes in 
this distribution, the first one being a power law P{iopt) ~ (^opt)^ which is evident from the 
figure, with g ^ 2. The second regime is an exponential P(^opt) ~ e °p* where C is a 
constant and 6 is around 2. This leads us to the conjecture that the distribution may have 
a Maxwellian functional form: 
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Where lo = V^(^opt)/2 is the most probable value of £opt- The solid line in the figure is the 
plot of this function and as seen it agrees with our numerical results. 

The exponents g and 6 can be obtained from the following heuristic arguments. The 
right tail of the distribution P(£opt) is determined by the distribution of the IIC size in 
the network of nodes. At percolation threshold (Sec. IVIIl . N nodes are divided into N/2 
clusters, obeying the power law distribution. However, the sum of all the cluster sizes is equal 
to A^, thus the distribution of the largest cluster sizes must have a finite size exponential 
cutoff P{S) ~ exp{—CS), as for the distribution of the segments of an interval divided by 
random partitions. Since S ~ i^U^ have S = de. 

To find the left tail distribution, we use the concept of MST. The chemical diameter of 
the MST is £opt while its mass is A^ ~ ^opt°^* • Due to self-similarity of the MST the number 
of nodes n{i) within a chemical distance i also scales as n {£) ~ ^i/'^opt. Thus the probabihty 
density of the of the optimal path for small values of i scales as dn{£)/d£ = f^/'^opt-i^ Hence 
g = l/t'opt — 1- We expect that our conjecture is valid also for SF networks. Using Eqs. (j37| 
and (|HH| we have: 

f 2, A > 4, ER 

S = de={ , 73 

[ (A-2)/(A-3), 3< A<4 

and 

2, A > 4, ER 

(74) 

2/(A-3), 3 < A < 4 

A recent work has studied the distribution form of shortest path lengths on minimum 

n 

spanning trees |40|| , which corresponds to optimal paths on networks with large variation in 
link weights (a — > oo). 

Using the scaling derived in Sect. IVIl more precisely: 

iia) ~ i^F (^] , (75) 

where pc is the percolation threshold and £oo ~ N'^°^^ is the optimal path length for strong 
disorder (a oo). For Erdos-Renyi (ER) graphs z/opt = 1/3- We generalize these results 
and suggest that the distribution of the optimal path lengths has the following scaling form: 

P(4pt,iV,a)~-^Gf^,iM. (76) 



-oo 



Pc a 
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FIG. 15: Scaled curve for the probability distribution P(£opt) of optimal path lengths for network 
sizes N = 1024,2048,4096,8192,16384,32768,65536. The gray curve represents Maxwellian fit 
given by Eq. ^ (After y )• 



The parameter Z = — which is equivalent to Z in Eq. (jU^ . determines the functional 
form of the distribution. Relation (fTBj) is supported by simulations Q for both ER and SF 
graphs, including SF graphs with 2 < A < 3, for which pc — with system size 

We simulate ER graphs with weights on the links for different values of graph size A^, 
control parameter a, and average degree {k) (which determines Pc = We then 

generate the shortest path tree (SPT) using Dijkstra's algorithm (See Section 111 B|) from 
some randomly chosen root node. Next, we calculate the probability distribution function 
of the optimal path lengths for all nodes in the graph 44 1 . 

In Fig. [iniwe plot iooP{iopt, N,a) vs. ^opt/^oo for different values of A^, a, and (k). A 
collapse of the curves is seen for all graphs with the same value of Z = 

Figure El shows similar plots for SF graphs - with a degree distribution of the form 
P{k) ~ k^'^ and with a minimal degree m. Scale-free graphs were generated according to 
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85|. A collapse 



the "configuration model" or Molloy Reed algorithm (See Section ITTK) 
is obtained for different values of N, a, A and m, with A > 3 and for the same values of Z. 
Next, we study SF networks with 2 < A < 3. In this regime the second moment of the 



degree distribution {k"^) diverges, leading to several anomalous properties 



For 



example: the percolation threshold approaches zero with system size according to Eq. ()25j) : 

3-A 

Pc ^ N ^-1 — > 0, and the optimal path length ioo in SD was found numerically to scale 
logarithmically with compared to polynomially, found in A > 3 j^^. Nevertheless, it 
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FIG. 16: Optimal path lengths distribution, P(^opt)) for ER networks with (a,b) Z = 



10 



and (c,d) Z = 3. (a) and (c) represent the unsealed distributions for Z = W and Z = 3 respectively, 
while (b) and (d) are the scaled distribution. Different symbols represent networks with different 
characteristics such as size = 2000, 4000, 8000 (which determines £ average degree 

(k) = 3,5,8 (which determines pc = l/(fc)), and disorder strength a = £oo/{pcZ) Results were 



averaged over 1500 realizations (After 



seems from Fig. that the optimal paths probability distribution for SF networks with 
2 < A < 3 exhibits similar collapse for different values of N and a for the same Z (although 
its functional form is different compared to the A > 3 case) l44l . 

n 

We present evidence that the optimal path is related to percolation |47(. The numerical 
results suggest that for a finite disorder parameter a, the optimal path (on average) follows 
the percolation cluster in the network (i.e., links with weight below pc) up to a typical 
"characteristic length" ^ = apc, before deviating and making a "shortcut" (i.e. crossing 
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FIG. 17: Optimal path lengths distribution, P{1), for SF networks with (a,b) Z = = 10 and 
(c,d) Z = 2. (a) and (c) represent the unsealed distributions for Z = W and Z = 2 respectively, 
while (b) and (d) are the scaled distribution. Different symbols represent networks with different 
characteristics such as size N = 4000,8000 (which determines ioo ~ A^'^°f"), A = 3.5,5 and m = 
2 (which determine Pr), and disorder strength a = loo/iPcZ)- Results were averaged over 250 
realizations (After |4J|). 



a link with weight above pc)- For length scales below ^ the optimal path behaves as in 
strong disorder and its length is relatively long. The shortcuts have an effect of shortening 
the optimal path length from a polynomial to logarithmic form according to the universal 
function F{u) (Eq. (f7^). Thus, the optimal path for finite a can be viewed as consisting 
of "blobs" of size ^ in which strong disorder persists. These blobs are interconnected by 
shortcuts, which result in the total path being in weak disorder. 

We next present direct simulations supporting this argument [4^. We calculate the 
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FIG. 18: Optimal path lengths distribution function for SF graphs with A = 2.5, m = 2 and 
with Z = = 10. (a) represents the unsealed distribution for Z = 10 while (b) shows the 

scaled distribution. Different symbols represent graphs with different characteristics such as size 
N = 2000, 4000, 8000, 1600 (which determines too ~ log(iV) and pc ~ iV~^/^), and disorder strength 
a = ^oo/{PcZ). Results were averaged over 1500 realizations (After 



optimal path length i{a) inside a single network of size A^, for a given a, and find (Fig. IT^ 
that it scales differently below and above the characteristic length ^ = apc- For each node in 
the graph we find imin, which is the number of links ( "hop count s" ) along the shortest path 
from the root to this node without regarding the weight of the link. 

In Fig. El we plot the length of the optimal path i{a), averaged over all nodes with the 
same value of imin for different values of a. The figure strongly suggests that l{a) ~ exp(£mm) 
for length scales below the characteristic length ^ = apc (see the linear regime in Fig. 119b). 
while for large length scales i{a) ~ imin- 

For length scales smaller than ^ we have iopt = AN^^^ and imin = BlnN, where A and 
B are constants. Thus = exp {imin/ B) and iopt = Aexp (Imin/'^B). Consequently, we 
expect that: ^ = ^^^pC^-^/^g) = ^exp [{Uin - 3B\n^)/3B]. We find the best scaling in 
Fig.mfoTB = ^. 

This is consistent with our hypothesis that below the characteristic length (^ = apc) 
imin ~ logA^ and l{a) ~ N^/'^, while imin ~ logA^ and /(a) ~ logA^ above. 

In order to better understand why the distributions of iopt depend on Z according to 
Eq. (fTUI) . we suggest the following argument. The optimal path for a — > cxd, was shown to 
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FIG. 19: Transition between different scaling regimes for the optimal path length /(a) inside an 
ER graph with = 128, 000 nodes and {k) = 10. (a) shows the unsealed and (b) shows the scaled 
length of the optimal path l{a) averaged over all nodes with same value of imin- Different symbols 
represent different values of the disorder strength a. Fig. (b) shows that for length scales i{a) 
smaller than the "characteristic length", ^ = apc, l{a) grows exponentially relative to the shortest 
hopcount path imin (see solid line). This is consistent with l{a) ~ N^^^ and imin ~ logiV inside 
the range of size ^ = apc- For length scales above ^ both quantities scale as log A^. (After ^44]). 



be proportional to N'^^^ for ER graphs and A^'-'*' ^^^^"^ for SF graphs with 3 < A < 4 j32j |. 
For finite a the number of shortcuts, or number of blobs, is Z = ^ = —. The deviation of 
the optimal path length for finite a from the case of a ^ oo is a function of the number of 
shortcuts. These results explain why the parameter Z = ^ determines the functional form 
of the distribution function of the optimal paths (see also section IVIB ) . 

To summarize, we have shown that the optimal path length distribution in weighted 
random graphs has a universal scaling form according to Eq. (jTU)). We explain this behavior 
and demonstrate the transition between polynomial and logarithmic behavior of the average 
optimal path in a single graph. Our results are consistent with results found for finite 



dimensional systems 
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68[ : In finite dimension the parameter controlling the transition 



is Z 



where L is the system length and v is the correlation length critical exponent 



as in Eq.(jnS)). This is because only the "red bonds" - bonds that if cut would disconnect 
the percolation cluster |78| - control the transition (see also section IVlt B). 
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VIII. SCALE-FREE NETWORKS EMERGING FROM WEIGHTED RANDOM 
GRAPHS 



In this section we introduce a simple process that generates random scale-free networks 
with A = 2.5 from weighted Erdos-Renyi graphs T^]- We further show that the minimum 
spanning tree (MST) on an Erdos-Renyi graph is related to this network, and is composed 
of percolation clusters, which we regard as "super nodes", interconnected by a scale-free 
tree. We will see that due to optimization this scale-free tree is dominated by links having 
high weights — significantly higher than the percolation threshold pc- Hence, the MST 
naturally distinguishes between links below and above the percolation threshold, leading to 
a scale-free "supernode network". Our results may explain the origin of scale-free degree 
distribution in some real world networks. 

Consider an Erdos-Renyi (ER) graph with nodes and an average degree (k), thus 
having a total of N{k)/2 links. To each link we assign a weight chosen randomly and 
uniformly from the range [0, 1]. We define black links to be those links with weights below a 
threshold pc = Two nodes belong to the same cluster if they are connected by black 

links [Fig.EHta)]. 

From Section IVII follows that the number of clusters of s nodes scales as a power law. 
Us ~ s^^, with r = 2.5 for ER networks. Next, we merge all nodes inside each cluster into 
a single "supernode". We define a new "supernode network" [Fig. I^OT b)] of A^sn supernodes 
[4^. The links between two supernodes [see Figs. I^HT a) and l^nT b)] have weights larger than 
Pc- The degree distribution P{k) of the supernode network can be obtained as follows. Every 
node in a supernode has the same (finite) probability to be connected to a node outside the 
supernode. Thus, we assume that the degree k of each supernode is proportional to the 
cluster size s, which obeys rig ~ Hence P{k) ~ k^'^, with A = 2.5, as supported by 

simulations shown in Fig. |^ Furthermore, we also see that if the threshold for obtaining 
the clusters which are merged into supernodes is changed slightly, the degree distribution 
still remains scale free with A = 2.5, but with an exponential cutoff. This is an indication 
of the fact that there are still supernodes of high degree which are connected to many other 
(small) supernodes by links with weights significantly higher than pc] if this was not the 
case, a small change in the threshold would cause many clusters to merge and destroy the 
power law in the supernode network degree distribution. 
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(a) Original Networl< 



(c) IVIST of original Network 




FIG. 20: Sketch of the "supernode network", (a) The original ER network, partitioned into 
percolation clusters whose sizes s are power-law distributed, with Us ~ s~'^ where r = 2.5 for ER 
graphs. The "black" links are the links with weights below pc, the "dotted" links are the links 
that are removed by the bombing algorithm, and the "gray" links are the links whose removal will 
disconnect the network (and therefore are not removed even though their weight is above pc)- (b) 
The "supernode network": the nodes are the clusters in the original network and the links are 
the links connecting nodes in different clusters (i.e., "dotted" and "gray" links). The supernode 
network is scale-free with P{k) ~ k^'^ and A = 2.5. Notice the existence of self loops and of 
double connections between the same two supernodes. (c) The minimum spanning tree (MST), 
composed of black and gray links only, (d) The MST of the supernode network ("gray tree"), 
which is obtained by bombing the supernode network (thereby removing the "dotted" links), or 
equivalently, by merging the clusters in the MST to supernodes. The gray tree is scale-free, with 
A = 2.5 (After Q)- 



We next show that the MST on an ER graph is related to the supernode network, and 
therefore also exhibits scale-free properties. In the MST each path between two sites on the 
MST is the optimal path in the "strong disorder" limit Q, 1^, meaning that along this 
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FIG. 21: The degree distribution of the supernode network of Fig. 1(b), where the supernodes 
are the percolation clusters, and the links are the links with weights larger than pc (O)- The 
distribution exhibits a scale-free tail with A ~ 2.5. If we choose a threshold less than pc, we obtain 
the same power law degree distribution with an exponential cutoff. The different symbols represent 
slightly different threshold values: pc — 0.03 (□) and pc — 0-05 (A). The original ER network has 
A'" = 50, 000 and (k) = 5. Note that for k ~ (k) the degree distribution has a maximum (After 
0). 



path the maximum barrier (weight) is the smallest possible l^- 

Here we use the bombing algorithm (See Section Hit) ). If the removal of a link disconnects 
the graph, we restore the link and mark it "gray" ; otherwise the link [shown dotted in 
Fig. EHfa)] is removed. The links that are not bombed are marked as "black". In the 



bombing algorithm^on 



y links that close a loop can be removed. Because below criticality 



loops are negligible for ER networks {d —>■ oo), bombing does not modify the percolation 
clusters — where the links are black and have weights below pc- Thus, bombing modifies 
only links outside the clusters, so actually it is only the links of the supernode network that 
are bombed. Hence the MST resulting from bombing is composed of percolation clusters 
(composed of black links) and connected by gray links [Fig. I^UT c)]. 

From the MST of Fig. EHfc) we now generate a new tree, the MST of the supernode 
network, which we call the "gray tree", whose nodes are the supernodes and whose links 
are the gray links connecting them [see Fig. EOfd)]. Note that bombing the original ER 
network to obtain the MST of Fig. EOfc) is equivalent to bombing the supernode network 
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FIG. 22: (a) The degree distribution of the "gray tree" (the MST of the supernode network, shown 
in Fig. l2Ur d)'). in which the supernodes are percolation clusters and the links are the gray links. 
Different symbols represent different threshold values: pc (O); Pc + 0.01 (□) and pc + 0.02 (A). 
The distribution exhibits a scale- free tail with A ~ 2.5, and is relatively insensitive to changes in 
Pc- (b) The average path length ^gray on a the gray tree as a function of original network size. It 
is seen that Vay ~ logiVsn ~ log iV (After 0). 



of Fig. |2ntb) to obtain the gray tree, because the links inside the clusters are not bombed. 
We find [Fig|22ta)] that the gray tree has also a scale-free degree distribution P(/c), with 
A = 2.5 — the same as the supernode network jSC 
length £gray scales as £gray ~ log ~ log j4j 



. We also find [Fig. I22r b)] the average path 
, 0]. Note that even though the gray tree 



is scale- free, it is not ultra-small [7[, since the length does not scale as log log A^. 

Next we show that the bombing optimization, which leads to the MST, yields a significant 
separation between the weights of the links inside the supernodes and the links connecting 
the supernodes. As explained above, the MST is optimal in two senses: (i) the total weight 
of all links is minimal (ii) any path between any two nodes on the MST will encounter the 
smallest maximal harrier (weight) between these nodes. The last property is common to 
many physical systems (e.g. the protein folding network - see below). Accordingly, we study 
the weights encountered when traveling along a typical path on the MST. 

We consider all pairs of nodes in the original MST of A^ nodes [Fig. EHtc)] and calculate 
the typical path length £typ, which is the most probable path length on the MST. For each 
path of length £typ we rank the weights on its links in descending order. For the largest 
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weights ("rank 1 links"), we calculate the average weight Wr=i over all paths. Similarly, for 
the next largest weights ("rank 2 links") we find the average Wr=2 over all paths, and so 
on up to T — ^typ- Fi§- I23f a) shows as a function of rank r for three different network 
sizes = 2000, 8000, and 32000. It can be seen that weights below (black links inside 
the supernodes) are uniformly distributed and approach one another as increases. As 
opposed to this, weights above pc ("gray links") are not uniformly distributed, due to the 
bombing algorithm, and are independent of A^. Actually, weights above pc encountered along 
the optimal path (such as the largest weights Wi, W2 and w^) are significantly higher than 
those below pc- Fig. ESfb) shows that the links with the highest weights on the MST can 
be associated with gray links from very small clusters [Figs. EUT a) and OUT c)] (similar results 
have been obtained along the optimal path). 

As mentioned earlier, this property is present also in the original supernode network and 
hence the change in the threshold used to obtain the supernodes does not destroy the power 
law degree distribution but only introduces an exponential cutoff. We thereby obtain a scale- 
free supernode network with A = 2.5, which is not very sensitive to the precise value of the 
threshold used for defining the supernodes. For example, the scale-free degree distribution 
shown in Fig. l^^ a) for a threshold of Pc+0.01 corresponds to having only four largest weights 
on the optimal paths [see Fig. |2Sta)]. However, even for pc + 0.02 the degree distribution is 
well approximated by a scale-free distribution with A = 2.5 [see Fig.|22ta)]. This means that 
mainly very small clusters, connected with high-weight links to large clusters, dominate the 
scale-free distribution P{k) of the MST of the supernode network (gray tree). Hence, the 
bombing optimization process on an ER graph causes a significant separation between links 
below and above Pc to emerge spontaneously in the system, and by merging nodes connected 
with links of low weights, a scale-free network can arise. 

The process described above may be related to the evolution of some real world networks. 
Consider a homogeneous network with many components whose average degree (k) is well 
defined. Suppose that the links between the components have different weights, and that 
some optimization process separates the network into nodes which are well connected (i.e., 
connected by links with low weights) and nodes connected by links having much higher 
weights. If the well-connected components merge into a single node, this results in a new 
heterogeneous supernode network with scale free degree distribution. 

An example of a real world network whose evolution may be related to this model is the 
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protein folding network, which was found to be scale-free with A ~ 2.3 5^- The nodes 
are the possible physical configurations of the system and the links between them describe 
the possible transitions between the different configurations. We assume that this network 
is optimal because the system chooses the path with the smallest energy barrier from all 
possible trajectories in phase space. It is possible that the scale-free distribution evolves 
through a similar procedure as described above for random graphs: adjacent configurations 
with close energies (nodes in the same cluster) cannot be distinguished and are regarded as 
a single supernode, while configurations (clusters) with high barriers between them belong 
to different supernodes. 

A second example is computer networks. Strongly interacting computers (such as com- 
puters belonging to researchers from the same company or research institution) are likely 
to converge into a single domain, and thus domains with various sizes and connectivities 
are formed. This network might be also optimal, because packets destined to an external 
domain are presumably routed through the router which has the best connection to the 
target domain. 

To summarize, we have seen that any weighted random network hides an inherent scale- 
free "supernode network" js^. We showed that the minimum spanning tree, generated by 
the bombing algorithm, is composed of percolation clusters connected by a scale-free tree 
of "gray" links. Most of the gray links connect small clusters to large ones, thus having 
weights well above the percolation threshold that do not change with the original size of the 
network. Thus the optimization in the process of building the MST distinguishes between 
links with weights below and above the threshold, leading to a spontaneous emergence of 
a scale-free "supernode network". We raise the possibility that in some naturally optimal 
real-world networks, nodes connected well merge into one single node, and thus a scale-free 
network emerges. 



IX. PARTITION OF THE MINIMUM SPANNING TREE INTO SUPERHIGH- 
WAYS AND ROADS 

The centrality, C, quantifies the "importance" of a node for transport in the network. 
Moreover, identifying the nodes with high C enables, as shown below, to improve their 
transport capacity and thus improve the global transport in the network. Several definitions 
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FIG. 23: (a) The average weights Wr along the optimal path of an ER graph with (k) = 5, 
sorted according to their rank. Different symbols represent different system sizes: = 2000 (O); 
N = 8000 (□) and N = 32000 (A). Below pc = 0.2, the weights are uniformly distributed, while 
weights above pc are significantly higher and independent of A^. (b) Cluster size vs. the minimal 
gray link emerging from each cluster, for ER graphs with (k) = 5 and = 10000. Small clusters 
are associated with higher weights because they have a small number of exits and thus cannot be 



optimized (After |77| ) . 



of centrality exists. Here we deal with the "betweeness centrahty" which is defined as the 



relative number of shortest path in the network passing through a node (or a link), 
probability density function (pdf) of C was studied on the MST for both scale-free (SF) 
and Erdos-Renyi (ER) 0, |^ networks and found to satisfy a power law, 



^'he 



Pmst(C) ~ C-^-sT (77) 

with (5mst close to 2 Q, Q|- An important question is weather there are substructure of 
the MST which are more central and play a major role on the transport. Reference jool 
shows that a sub-network of the MST, the infinite incipient percolation cluster (IIC) has a 
significantly higher average C than the entire MST [59| — i.e., the set of nodes inside the 
IIC are typically used by transport paths more often than other nodes in the MST jo^. 
— In this sense the IIC can be viewed as a set of superhighways (SHW) in the MST. The 
nodes on the MST which are not in the IIC are called roads, due to their analogy with roads 
of less traffic (usually used by local residents). We demonstrate the impact of this finding 
by showing that improving the capacity of the superhighways (IIC) is significantly a better 
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strategy to enhance global transport compared to improving the same number of links with 



the highest C in the MST, although they have higher C j60|. This counterintuitive result 
shows the advantage of identifying the IIC subsystem, which is very small compared to the 
full network 63] . These results are based on extensive numerical studies for centrality of the 



IIC, and comparison with the centrality of the entire MST j55|, |60|, as described below. ER 
and SF network of size are generated by the methods explained in section Hi Al Multiple 
connections between two nodes and self-loops in a single node are disallowed. To construct 



3etween 
see 



JCLV 

4l| 



a weighted network, a weight Wi is assigned to each link from a uniform distribution 
and 1. The MST is obtained from the weighted network using Prim's algorithm 
Section [IIE|) . Once the MST is built, one can compute the value of C of each node by 
counting the number of paths between all possible pairs passing through that node and 
normalize C by the total number of pairs in the MST, N{N — l)/2, which ensures that 
C is between and 1 |6^. The IIC of ER and SF networks is simulated as explained in 
Section ITTfI 

To quantitatively study the centrality of the nodes in the IIC, we calculate the pdf, 
ViiciC) of C. Figure 1211 shows that for all three cases studied, ER, SF and square lattice 
networks, 'Piic(C*) for nodes satisfies a power law 

Piic(C^) ~ (78) 

where 



>iic 



1.2 [ER,SF] ^ ^ 

^ ' ^ . (79) 

1.25 [square lattice] 



Moreover, from Fig.|211 it is seen that 5nc < ^msti implying a larger probability to find a 
larger value of C in the IIC compared to the entire MST. The values for (5mst are consistent 
with those found in Ref. 



58| . Similar results for the centrality of the links were obtained. 
The results thus show that the IIC is like a network of superhighways inside the MST. When 
we analyze centrality of the entire MST, the effect of the high C of the IIC is not seen since 
the IIC is only a tiny fraction of the MST. Some results are summarized in Table HTl 

The values of 5mst and (5nc can be understood from the following scaling arguments, based 



25 



to 



self-similarity properties of the MST and the IIC. Similar arguments are used in 
express exponent describing the cluster size distribution at percolation threshold on a 
lattice in terms of the cluster fractal dimension and the dimension of the lattice. Indeed, 
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the majority of nodes are connected through the superhighway hnks, whose centrahty is 



proportiona 



to N"^. The number of these hnks for the entire network scales as ioo{N) 



pt 



^i/i^opt ^ Thus small regions of the MST of chemical diameter i consists of n 
nodes. These regions have length £00 (^) ~ n^/'^°p'. These roads connect the nodes in this 
region with the rest of the nodes of the MST, and thus their centrality is at least nN. The 
total number of such links in all the regions of size n is m(n) = ioo{'n)N/n ~ n'^°p^~^. This 
is the number of links with centrality larger than nN. Thus number of links with centrality 



(80) 



exactly nN is m(n) — m{n + 1) ~ n'^°p* ^ = n "^mst^ Using Eq. ()38|) we have: 

^ _ f 5/3, A > 4, ER 

omst — \ 

[ (A+1)/(A-1), 3< A<4 

Similar arguments lead to the centrality distribution of the nodes on the IIC. The small 
regions of chemical diameter i have centrality larger or equal to Nn{i). The number of links 
in the IIC belonging to these regions is s{i) = i'^'^ jS]. The total number of such regions is 
S/s{i), thus the total number of the links of the IIC with centrality larger than Nn{i) is 
m[n{i)] = i/s{i) ~ i^-'^' = n^^-'^^) Accordingly 



Sue = l + {di - 



opt 



(81) 



4/3, A > 4, ER 
A/(A- 1), 3 < A < 4 

The values predicted by Eqs. (jHUj) and (|HT|) are in good agreement with the simulation 
results presented in Table ITTl 

To further demonstrate the significance of the IIC, we compute the average (C) for each 
realization of the network over all nodes. Fig. |23 shows the histograms of (C) for both the 
IIC and for the other nodes on the MST. We see that the nodes on the IIC have significantly 
larger (C) compared to the other nodes of the MST. 
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SF (A = 4.5) 


SF (A = 3.5) 
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1.32 


Z^opt 


1/3 


1/3 


0.2 


0.61 


{u) 


0.29 


0.20 


0.13 


0.64 



TABLE II: Results for the IIC and the MST (After 
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FIG. 24: The pdf of the centrahty C of nodes for (a) ER graph with (k) = 4, (b) SF with A = 4.5, 
(c) SF with A = 3.5 and (d) 90 x 90 square lattice. For ER and SF, N = 8192 and for the square 
lattice N = 8100 . We analyze 10"^ realizations. For each graph, the full circles show 'Piic(C); the 



empty circles show Vmst{C) (After 



60|). 



Figure |2S1 shows a schematic plot of the SHW inside the MST and demonstrates its use 
by the path between pairs of nodes. The MST is a "skeleton" subset of links inside the 
network, which plays a key role in transport between the nodes. However, the IIC in the 
MST is like the "spine in the skeleton", which plays the role of the superhighways inside a 
road transportation system. To illustrate our result a car can drive from the entry node A 
on roads until it reaches a superhighway, and finds the exit which is closest to node B. Thus 
those nodes which are far from each other in the MST use the IIC superhighways more than 
those nodes which are close to each other. In order to demonstrate this, we compute /, the 
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FIG. 25: Schematic graph of the network of connected superhighways (heavy hnes) inside the MST 
(shaded). A, B and C are examples of possible entry and exit nodes, which connect to the network 



of superhighways by "roads" (thin lines). The midd^ 
with much smaller size compared to the IIC (After 



e size lines indicate other percolation clusters 



6^ 



average fraction of pairs of nodes using by the shortest paths the IIC, as a function of iusT, 
the distance between a pair of nodes on the MST (Fig. 1^ . We see that / increases and 
approaches one as £mst grows. We also show that / scales as £mrt/ ./V^°p* for different system 
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sizes, where z/opt is the percolation connectedness exponent 

The next question is how much the IIC is used in transport on the MST? We define the 
IIC superhighway usage, 

(82) 



u = 



?MST ' 



where iuc is the number of links in a given path of length ^mst belonging to the IIC 
superhighways. The average usage {u) quantifies what fraction of nodes/links of the IIC is 
used by the transport between all pairs of nodes. In Fig. EHfa), we show (u) as a function of 
the system size A^. Our results suggest that (u) approaches a constant value and becomes 
independent of A^ for large A^. This is surprising since the average value of the ratio between 
the number of nodes on the IIC and on the MST, (A'hc/^mst), approaches zero as A^ — *• 



oo 



63| . showing that although the IIC contains only a tiny fraction of the nodes in the 
entire network, its usage for the transport in the entire network is constant. We find that 
(u) ~ 0.3 for ER networks, (u) ~ 0.2 for SF networks with A = 4.5, and (u) ~ 0.64 for 
the square lattice. The reason why {u) is not close to 1.0 is that in addition to the IIC, 



59 




20 



FIG. 26: The average fraction, (/), of pairs using the SHW, as a function of Imst, the distance 
between the pair on the MST. (a) ER graph with (A;) = 4, (b) SF with A = 4.5, (c) SF with A = 3.5 
and (d) square lattice. For ER and SF: (O)^ = 1024 and {a)N = 2048 with 10^ reahzations. For 
square lattice: (O)^ = 1024 and {a)N = 2500 with 10^ realizations. The X axis is rescaled by 
N^opt^ where u^ nt = 1/3 for ER and for SF with A > 4, and Uopt = (A - 3)/A - 1) for SF networks 
with 3 < A < 4 [321 . For the Lx L square lattice, ^mst ~ L'^°p^ with dopt = 1-22 and since = N, 



i^opt = c^opt/2 ~ 0.61 
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3^ 



(After 
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the optimal path passes also through other percolation clusters, such as the second largest 
and the third largest percolation clusters. In Fig. |2H1 "we also show for ER networks, the 
average usage of the two largest and the three largest percolation clusters for a path on the 
MST and we see that the average usage increases significantly and is also independent of A^. 
However, the number of clusters used by a path on MST is relatively small and proportional 
to In N j47|, suggesting that the path on the MST uses only few percolation clusters and few 
jumps between them (of order InA^) when traveling from an entry node to an exit node on 
the network. When N —>■ oo the average usage of all percolation clusters should approach 1. 
Can we use the above results to improve the transport in networks? It is clear that by 
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FIG. 27: The normalized pdf for superhighway and roads of (C), the C averaged over all nodes 
in one realization, (a) ER network, (b) SF network with A = 4.5, (c) SF network with A = 3.5 



and (d^ 
(After 



^uare lattice network. To make each histogram, we analyze 1000 network configurations 

m 



improving the capacity or conductivity of the highest C links one can improve the transport 
(see Fig. OHT b) inset). We hypothesize that improving the IIC links (strategy I), which 
represent the superhighways is more effective than improving the same number of links with 



the highest C in the MST (strategy II), although have higher centrality [6^. To test the 
hypothesis, we study two transport problems: (i) current flow in random resistor networks, 
where each link of the network represents a resistor and (ii) the maximum flow problem well 

n 

known in computer science We assign to each link of the network a resistance/capacity, 
e""^, where x is an uniform random number between and 1, with a = 40. The value of 
a is chosen such as to have a broad distribution of disorder so that the MST carries most 
of the flow 



471, 



68l |. We randomly choose n pairs of nodes as sources and other n nodes 
as sinks and compute the flow between them. We compare the transport by improving the 
conductance/capacity of all links on the IIC (strategy I) with that by improving the same 
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FIG. 28: (a) The average usage (u) = (^iic/^mst) for different networks, as a function of the 
number of nodes N. Q (ER with (A;) = 4), □ (SF with A = 4.5), O (SF with A = 3.5), A {L x L 
square lattice). The symbols ([>) and (<l) represent the average usage for ER with (k) = 4 when the 
two largest percolation clusters and the three largest percolation clusters are taken into account, 
respectively, (b) The ratio between the flow using strategy I, Fsj, and that using strategy II, Fsu, 
as a function of the factor of improving conductivity /capacity. The inset is the ratio between the 
flow using strategy I and the flow in the original network, Fq. The data are all for ER networks 
with N = 2048, {k) = 4 and n = 50(0), n = 250(0) and n = 500(n). The unfilled symbols are 
for current flow and the filled symbols are for maximum flow (After jsd])- 

number of links with the highest C in the MST (strategy II). Since the two sets are not 
the same and therefore higher centrality links will be improved in II 6J], it is tempting to 
suggest that the better strategy to improve the global flow is strategy II. However, here we 
demonstrate using ER networks as an example that counterintuitively strategy I is better. 
We also find similar advantage of strategy I compared to strategy II for SF networks with 
A = 3.5. 

In Fig. EHtb), we compute the ratio between the flow using strategy I (F^i) and the flow 
using strategy II (-Fsn) as a function of the factor of improving conductivity/capacity of the 
links. The figure clearly shows that strategy I is better than strategy II. Since the number 
of links in the IIC is relatively very small comparing to the number of links in the whole 
network [63i], it could proven to be a very efficient strategy. 
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In summary, we find that the centrahty of the IIC for transport in networks is significantly 
larger than the centrality of the other nodes in the MST. Thus the IIC is a key component 
for transport in the MST. We demonstrate that improving the capacity/conductance of the 
links in the IIC is useful strategy to improve transport. 



X. SUMMARY 



We reviewed recent studies on the scaling of the average optimal path length £opt in a 
disordered network. There are two scaling regimes of ^opt corresponding to the regimes of 
weak and strong disorder. For ER networks and SF networks with A > 4, £opt ~ InA^ in 
the weak disorder regime while £opt ~ N'^^^ in the strong disorder regime. For SF networks 
with 3 < A < 4, £opt ~ InA^ in the weak disorder regime while £opt ~ A^^-^ in the strong 
disorder regime. For SF networks with 2 < A < 3, £opt ~ InA^ in the weak disorder regime 
while iopt ~ ln'^~^ in the strong disorder regime. The scaling behavior of iopt in the 
strong disorder regime for ER and SF networks with A > 3 is obtained analytically using 
percolation theory fs^. For exponential disorder, for both ER random networks and SF 
networks we obtain a scaling function for the crossover from weak disorder characteristics to 
strong disorder characteristics. We show that the crossover occurs when the min-max path 
reaches a crossover length i*{a) and i*{a) ~ a. Equivalently, the crossover occurs when the 
network size reaches a crossover size N*{a), where N*{a) ~ for ER networks and for 

A-l 

SF networks with A > 4 and N*{a) ~ a^-a for SF networks with 3 < A < 4. 

We also have shown that the optimal path length distribution in weighted random graphs 
has a universal scaling form according to Eq. ()76|) . We explain this behavior and demonstrate 
the transition between polynomial to logarithmic behavior of the average optimal path in a 
single graph. 

Our results are consistent with results found for finite dimensional systems 
In finite dimension the parameter controlling the transition is where L is the system 
length and u is the correlation length critical exponent. This is because only the "red bonds" 
- bonds that if cut would disconnect the percolation cluster [t^ - control the transition. 

We also show that any weighted random network hides an inherent scale-free "supernode 
network" 8^. We showed that the minimum spanning tree, generated by the bombing 
algorithm, is composed of percolation clusters connected by a scale-free tree of "gray" links. 
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Most of the gray links connect small clusters to large ones, thus having weights well above the 
percolation threshold that do not change with the size of the network. Thus the optimization 
in the process of building the MST distinguishes between hnks with weights below and above 
the threshold, leading to a spontaneous emergence of a scale-free "supernode network" with 
A = 2.5. We raise the possibility that in some naturally optimal real- world networks, nodes 
connected well merge into one single node, and thus a scale-free network emerges. 

The centrahty in networks for transport on the MST is studied. We found that the 
centrahty of the nodes in the IIC is significantly larger than the centrality of the other nodes 
in the MST. The analytical estimation for the exponents of the centrality distribution for 
both the MST and the IIC are provided. Thus the IIC is a key component for transport in the 
MST. As a result of this finding, we demonstrated that improving the capacity/conductance 
of the links in the IIC is a useful strategy to improve transport which is a better strategy 
compare to improving the same number of links with the highest centrality in the MST. 
This is probably due to the global nature of transport which prefer global improvement of 
the superhighways rather than local improvement of high centrality links. 
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